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Phytase variants 

FIELD OF THE INVENTION 

This invention relates to variants of phytases, in 
particular variants of ascomycete phytases and variants of 
5 basidiomycete phytases, the corresponding cloned DNA sequences, 
a method of producing such phytase variants, and the use thereof 
for a number of industrial applications. 

BACKGROUND OF THE INVENTION 

10 Phytic acid or myo-inositol 1, 2, 3, 4, 5, 6-hexakis dihydrogen 

phosphate (or for short myo-inositol hexakisphosphate) is the 
primary source of inositol and the primary storage form of 
phosphate in plant seeds. Phytin is a mixed potassium, magnesium 
and calcium salt of inositol. 

15 The phosphate moieties of phytic acid chelates divalent 

and trivalent cations such as metal ions, i.a. the nutritionally 
essential ions of calcium, iron, zinc and magnesium as well as 
the trace minerals manganese, copper and molybdenum. 

Phytic acid and its salts, phytates, are often not 

20 metabolized, i.e. neither the phosphorous thereof, nor the 
chelated metal ions are nutritionally available. 

Accordingly, food and feed preparations need to be 
supplemented with inorganic phosphate and often also the 
nutritionally essential ions such as iron and calcium, must be 

25 supplemented. 

Still further, the phytate phosphorus passes through the 
gastrointestinal tract of such animals and is excreted with the 
manure, resulting in an undesirable phosphate pollution of the 
environment resulting e.g. in eutrophication of the water 

30 environment and extensive growth of algae. 
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Phytic acid or phytates, said terms being, unless 
otherwise indicated, in the present context used synonymously or 
at random, are degradable by phytases. 

The production of phytases by plants as well as by 
5 microorganisms has been reported. Amongst the microorganisms, 
phytase producing bacteria as well as phytase producing fungi 
are known. 

There are several descriptions of phytase producing 
filamentous fungi belonging to the fungal phylum of Ascomycota 

10 (ascomycetes) . In particular, there are several references to 
phytase producing ascomycetes of the Aspergillus genus such as 
Aspergillus terreus (Yamada et al., 1986, Agric. Biol. Chem. 
322:1275-1282). Also, the cloning and expression of the phytase 
gene from Aspergillus niger var. awamori has been described 

15 (Piddington et al., 1993, Gene 133:55-62). EP 0420358 describes 
the cloning and expression of a phytase of Aspergillus ficuum 
(niger) . EP 0684313 describes the cloning and expression of 
phytases of the ascomycetes Aspergillus niger, Myceliophthora 
thermophila, Aspergillus terreus. Still further, some partial 

20 sequences of phytases of Aspergillus nidulans, Talaromyces 
thermophilus, Aspergillus fumigatus and another strain of 
Aspergillus terreus are given. 

The cloning and expression of a phytase of Thermomyces 
lanuginosus is described in WO 97/35017. 

25 There is a current need for phytases of amended properties 

or characteristics, e.g. phytases of increased thermostability, 
altered pH optimum (a high pH optimum being desirable for in- 
vitro processing, a low for in-vivo processing in the gastro- 
intestinal tract), and/or of a higher specific activity. 
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SUMMARY OF THE INVENTION 

In a first aspect, the invention provides phytase 
variants, the characteristics of which are amended - as compared 
to a so-called model phytase. 
5 Any model phytase, which is of a certain similarity to 

thirteen herein specifically disclosed model phytases, can be 
made the model of such variants. 

In another aspect, the invention relates to a novel 
phytase derived from Cladorrhinum f oecundissimum. 
10 In still another aspect, the invention provides DNA 

sequences encoding these phytase variants and this phytase, and 
methods of their production. 

Finally, the invention also relates generally to the use 
of the phytase and the phytase variants for liberating 
15 phosphorous from any phytase substrate, in particular inorganic 
phosphate from phytate or phytic acid. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In the detailed description of the invention below, 
20 reference is made to the drawings, of which 



Fig. 1 is an alignment of thirteen specific phytase 

sequences (a multiple sequence alignment according 
to the program PileUp; GapWeight : 3.000; 
25 GapLengthWeight : 0.100); 

Fig. 2 this figure shows the amino acid and DNA sequence of 

a first phytase C x P_involtus-Al") derived from 
strain CBS 100231 of Paxillus involutus which was 
30 deposited on 28.11.97; the expression plasmid pYES 

2.0 comprising the full length cDNA sequence was 
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transformed into E . coli strain DSM 11842 which was 
deposited on 12,11.97 (see WO 98/28409); 

Fig. 3 this figure shows the amino acid and DNA sequence of 

5 a second phytase ("P_involtus-A2") derived from 

strain CBS 100231 of Paxillus involutus which was 
deposited on 28.11,97; the expression plasmid pYES 
2.0 comprising the full length cDNA sequence was 
transformed into E. coli strain DSM 1184 3 which was 
10 deposited on 12.11.97 (see WO 98/28409); 

Fig. 4 this figure shows the amino acid and DNA sequence of 

a phytase C x T_pubescens") derived from strain 
CBS 100232 of Trametes pubescens, which was 
15 deposited on 28.11.97; the expression plasmid pYES 

2.0 comprising the full length cDNA sequence was 
transformed into E. coli strain DSM 11844 which was 
deposited on 12.11.97 (see WO 98/28409); 

20 Fig. 5 this figure shows the amino acid and DNA sequence of 

a phytase ("A_pediades") derived from strain CBS 
900.96 of Agrocybe pediades deposited on 04.12.96; 
the expression plasmid pYES 2.0 comprising the full 
length cDNA sequence was transformed into E. coli 

25 strain DSM 11313 which was deposited on 02.12.96 

(see WO 98/28409) ; 

Fig- 6 this figure shows the amino acid and DNA sequence of 

a phytase ("P_lycii") derived from strain CBS 686.96 
30 of Peniophora lycii which was deposited on 04.12.96; 

the expression plasmid pYES 2.0 comprising the full 
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length cDNA sequence was transformed into E. coli 
strain DSM 11312 which was deposited on 02.12.96 
(see WO 98/28409) ; 



5 Fig. 7 this figure equals figure 2 of EP 0684313 and shows 

the amino acid and DNA sequence of a phytase 
("M_thermophila") derived from strain ATCC 4 8102 
(=ATCC 7 4 340) of Myceliophthora thermophila which 
was re-deposited on 14.03.97; 

10 

Fi< 3- 8 this figure shows the amino acid and DNA sequence of 

a phytase ("A_f umigatus") derived from strain ATCC 
13073 of Aspergillus fumigatus (see EP 0897985) ; 

15 Fig. 9 this figure shows the amino acid ("Conphys") and DNA 

sequence of an ascomycete consensus phytase (in the 
present context called "consphyA") (see EP 0897985); 

Fig. 10 this figure shows the amino acid and DNA sequence of 

20 a phytase ("A_nidulans") derived from strain 

DSM 9743 of Aspergillus nidulans (see EP 0897985); 

Fig. 11 this figure equals figure 8 of EP 0420358 and shows 

the amino acid and DNA sequence of a phytase 
25 ("A_f icuum") derived from Aspergillus ficuum strain 

NRRL-3135; 



Fig. 12 this figure equals figure 1 of EP 0684313 and shows 

the amino acid and DNA sequence of a phytase 
("A_terreus") derived from strain CBS 220.95 of 
Aspergillus terreus ; 
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Fig. 13 this figure shows the amino acid and DNA sequence of 

a phytase ("T^thermo") derived from strain ATCC 
20186 (=ATCC 74338) of Talaromyces thermophilus 
which was redeposited on 14.03.97 (see EP 0897985); 

Fig. 14 this figure equals figure 2 of WO 97/35017 and shows 

the amino acid and DNA sequence of a phytase 
("T_lanuginosa") derived from strain CBS 586.94 of 
Thermomyces lanuginosus; a plasmid comprising the 
full length cDNA sequence was transformed into 
E. coli DH5cc (pMWR4 6) strain B-21527 which was 
deposited with NRRL on 23.02.96; 

15 Fig. 15 this figure shows the amino acid and DNA sequence of 

a phytase ("C_f oecundissimum") derived from strain 
CBS 427.97 of Cladorrhinum f oecundissimum which was 
deposited on 23 January 1997; the expression plasmid 
pYES 2.0 comprising the full length cDNA sequence 

20 was transformed into E. coli strain DSM 12742 which 

was deposited on 17 March 1999; 

Fig. 16 this figure shows an alignment of the phytase 

C_foecundissimum with the model phytase 
25 M_thermophila, using the program GAP gcg (Gap Weight 

3.000; Length Weight 0.100); and 

Fig. 17 shows how the C_f oecundissimum phytase can be pasted 

onto the alignment of Fig. 1. 



30 
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DETAILED DISCLOSURE OF THE INVENTION 

Phytase 

In the present context a phytase is an enzyme which 
catalyzes the hydrolysis of phytate (myo-inositol 
5 hexakisphosphate) to (1) myo-inositol and/or (2) mono-, di-, 
tri-, tetra- and/or penta-phosphates thereof and (3) inorganic 
phosphate. In the following, for short, the above compounds are 
sometimes referred to as IP6, I, IP1, IP2, IP3, IP4, IPS and P, 
respectively. This means that by action of a phytase, IP6 is 

10 degraded into P + one or more of the components IP5, IP4, IP3, 
IP2, IP1 and I. Alternatively, myo-inositol carrying in total n 
phosphate groups attached to positions p, q, r, . . is denoted 
Ins (p, q, r, . . ) Pn. For convenience Ins (1, 2, 3, 4, 5, 6) P6 (phytic 
acid) is abbreviated PA. 

15 According to the Enzyme nomenclature database ExPASy (a 

repository of information relative to the nomenclature of 
enzymes primarily based on the recommendations of the 
Nomenclature Committee of the International Union of 
Biochemistry and Molecular Biology (IUBMB) describing each type 

20 of characterized enzyme for which an EC (Enzyme Commission) 
number has been provided) , two different types of phytases are 
known: A so-called 3-phytase (myo-inositol hexaphosphate 3- 
phosphohydrolase, EC 3.1.3.8) and a so-called 6-phytase (myo- 
inositol hexaphosphate 6-phosphohydrolase, EC 3.1.3.26). The 3- 

25 phytase hydrolyses first the ester bond at the D-3-position, 
whereas the 6-phytase hydrolyzes first the ester bond at the D- 
6- or L-6-position. 

The expression "phytase" or "polypeptide or enzyme 
exhibiting phytase activity" is intended to cover any enzyme 
30 capable of effecting the liberation of inorganic phosphate or 
phosphorous from various myo-inositol phosphates. Examples of 
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such myo-inositol phosphates (phytase substrates) are phytic 
acid and any salt thereof, e.g. sodium phytate or potassium 
phytate or mixed salts. Also any stereoisomer of the mono-, di-, 
tri-, tetra- or penta-phosphates of myo-inositol might serve as 
5 a phytase substrate. A preferred phytase substrate is phytic 
acid and salts thereof. 

In accordance with the above definition, the phytase 
activity can be determined using any assay in which one of these 
substrates is used. In the present context (unless otherwise 
10 specified) the phytase activity is determined in the unit of 
FYT, one FYT being the amount of enzyme that liberates 1 fimol 
inorganic ortho-phosphate per min. under the following 
conditions: pH 5.5; temperature 37 °C; substrate: sodium phytate 
(C 6 H 6 0 24 P 6 Na 12 ) in a concentration of 0.0050 mol/1. A suitable 
15 phytase assay is described in the experimental part. 

The present invention provides a genetically engineered 
phytase as described in the appending claims. 

A genetically engineered phytase is a non-naturally 
occuring phytase which is different from a model phytase, e.g. a 
20 wild-type phytase. Genetically engineered phytases include, but 
are not limited to, phytases prepared by site-directed 
mutagenesis, gene shuffling, random mutagenesis etc. 

The invention also provides DNA constructs, vectors, host 
cells, and methods of producing these genetically engineered 
25 phytases and phytase variants, as well as uses thereof. 

A phytase variant is a polypeptide or enzyme or a fragment 
thereof which exhibits phytase activity and which is amended as 
compared to a model phytase. 

Amended means altered by way of one or more amino acid or 
30 peptide substitutions, deletions, insertions and/or additions - 
in each case by, or of, one or more amino acids. Such 
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substitutions, deletions, insertions, additions can be achieved 
by any method known in the art, e.g. gene shuffling, random 
mutagenesis, site-directed mutagenesis etc. 

The model or parent phytase, from which the phytase 
5 variant is derived, can be any phytase, e.g. a wild-type phytase 
or a derivative, mutant or variant thereof, including allelic 
and species variants, as well as genetically engineered variants 
thereof, which e.g. can be prepared by site-directed 
mutagenesis, random mutagenesis, shuffling etc. 

10 Included in the concept of model phytase is also any 

hybrid or chimeric phytase, i.e. a phytase which comprises a 
combination of partial amino acid sequences derived from at 
least two phytases. 

The hybrid phytase may comprise a combination of partial 

15 amino acid sequences deriving from at least two ascomycete 
phytases, at least two basidiomycete phytases or from at least 
one ascomycete and at least one basidiomycete phytase. These 
ascomycete and basidiomycete phytases from which a partial amino 
acid sequence derives may, e.g., be any of those specific 

20 phytases referred to herein. 

In the present context, a hybrid, shuffled, random 
mutagenised, site-directed mutagenised or otherwise genetically 
engineered phytase derived from ascomycete phytases only is also 
an ascomycete phytase; and a hybrid, shuffled, random 

25 mutagenised, site-directed mutagenised or otherwise genetically 
engineered phytase derived from model basidiomycete phytases 
only is also a basidiomycete phytase. Any hybrid derived from at 
least one ascomycete phytase as well as at least one 
basidiomycete phytase is called a mixed ascomycete/basidiomycete 

30 phytase and such phytase is also a model phytase in the present 
context . 
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Analogously, a hybrid, shuffled, random mutagenised, site- 
directed mutagenised or otherwise genetically engineered phytase 
derived from one or more Aspergillus phytases is also an 
Aspergillus derived phytase; and a hybrid, shuffled, random 
5 mutagenised, site-directed mutagenised or otherwise genetically 
engineered phytase derived from any other taxonomic sub-grouping 
mentioned herein is also to be designated a phytase derived from 
this taxonomic sub-grouping. 

Still further, in the present context, "derived from" is 

10 intended to indicate a phytase produced or producible by a 
strain of the organism in question, but also a phytase encoded 
by a DNA sequence isolated from such strain and produced in a 
host organism transformed with said DNA sequence. Finally, the 
term is intended to indicate a phytase which is encoded by a DNA 

15 sequence of synthetic and/or cDNA origin and which has the 
identifying characteristics of the phytase in question. 

Preferably the model phytase is a phytase which can be 
aligned as described below to either of the thirteen phytases of 
Fig. 1 (which are particularly preferred model phytases) . 

20 Preferred wild-type model phytases (i.e. neither 

recombinant, or shuffled or otherwise genetically engineered 
phytases) have a degree of similarity or homology, preferably 
identity, to amino acid sequence no. 38-403 (Peniophora numbers) 
of either of these thirteen phytases of at least 40%, more 

25 preferably at least 50%, still more preferably at least 60%, in 
particular at least 70%, especially at least 80%, and in a most 
preferred embodiment a degree of similarity of at least 90%. 

Preferred recombinant or shuffled or otherwise genetically 
engineered model phytases have a degree of similarity or 

30 homology, preferably identity, to partial sequence no. 38-4 9, 
63-77, 274-291, 281-300 and 389-403 (Peniophora numbers) of 



WO 99/49022 PCT/DK99/00153 

11 

either of these thirteen phytases of at least 60%, more 
preferably at least 70%, still more preferably at least 80%, in 
particular at least 90%. 

In a preferred embodiment the degree of similarity is 
5 based on a comparison with the complete amino acid sequence of 
either of the thirteen phytases. 

The degree of similarity or homology, alternatively 
identity, can be determined using any alignment programme known 
in the art, A preferred alignment programme is GAP provided in 

10 the GCG version 8 program package (Program Manual for the 
Wisconsin Package, Version 8, August 1994, Genetics Computer 
Group, 575 Science Drive, Madison, Wisconsin, USA 53711) (see 
also Needleman, S.B. and Wunsch, CD., (1970), Journal of 
Molecular Biology, 48, 443-453) . Using GAP with the following 

15 settings for polypeptide sequence comparison: GAP weight of 
3.000 and GAP lengthweight of 0.100. 

Also preferred is a wild-type model phytase which 
comprises an amino acid sequence encoded by a DNA sequence which 
hybridizes to a DNA sequence encoding amino acid sequence 38-403 

20 (Peniophora numbers) of any of the DNA sequences encoding the 
thirteen specific phytase sequences of Fig. 1. 

A further preferred model phytase is a genetically 
engineered phytase, which comprises an amino acid sequence 
encoded by a DNA sequence which hybridizes to a DNA sequence 

25 encoding amino acid sequence 38-49, and to a DNA sequence 
encoding amino acid sequence 63-7 7, and to a DNA sequence 
encoding amino acid sequence 274-291, and to a DNA sequence 
encoding amino acid sequence 281-300, and to a DNA sequence 
encoding amino acid sequence 389-403 (Peniophora numbers) of any 

30 of the DNA sequences encoding the thirteen specific phytase 
sequences of Fig. 1. 
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In a preferred embodiment the hybridization is to the 
complete phytase encoding part of any of the thirteen phytases. 

Suitable experimental conditions for determining whether a 
given DNA or RNA sequence "hybridizes" to a specified nucleotide 
5 or oligonucleotide probe involves presoaking of the filter 
containing the DNA fragments or RNA to examine for hybridization 
in 5 x SSC (Sodium chloride/Sodium citrate), (J. Sambrook, E. F. 
Fritsch, and T. Maniatis, 1989, Molecular Cloning, A Laboratory 
Manual, 2d edition, Cold Spring Harbor, New York) for 10 min, 

10 and prehybridization of the filter in a solution of 5 x SSC, 5 x 
Denhardt's solution (Sambrook et al. 1989), 0.5 % SDS and 100 
Ug/ml of denatured sonicated salmon sperm DNA (Sambrook et al. 
1989), followed by hybridization in the same solution containing 
a concentration of 10 ng/ml of a random-primed (Feinberg, A. P. 

15 and Vogelstein, B. (1983) Anal. Biochem. 132:6-13), 32P-dCTP- 
labeled (specific activity > 1 x 109 cpm/jig) probe for 12 hours 
at approximately 4 5°C. 

The filter is then washed twice for 30 minutes in 2 x SSC, 
0.5 % SDS at at least 55°C (low stringency), at at least 60°C 

20 (medium stringency), at at least 65 °C (medium/high stringency), 
at at least 70°C (high stringency), or at at least 75°C (very 
high stringency) . 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions are detected using an x-ray film. 

25 It should be noted that a certain specific phytase variant 

need not actually have been prepared from a specific model 
phytase, for this model phytase to qualify as a "model phytase" 
in the present context. It is sufficient that the variant 
exhibits at least one of the herein indicated amendments when it 

30 is afterwards compared with the model phytase. 
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The alignment of Fig. 1 is made using the program PileUp 
(Program Manual for the Wisconsin Package, Version 8, August 
1994, Genetics Computer Group, 575 Science Drive, Madison, 
Wisconsin, USA 53711), with a GapWeight of 3.000 and a 
5 GapLengthWeight of 0.100. When aligning a new model phytase or a 
new phytase variant all thirteen sequences can be included 
together with the new phytase (variant) in a multiple alignment, 
or, alternatively, at least one of the thirteen sequences of 
Fig. 1 is included together with the new phytase (variant) in an 
10 alignment. 

A preferred procedure for aligning according to Fig. 1 a 
new model phytase (or a phytase variant) is as follows: The new 
model phytase is aligned with that specific sequence of the 
thirteen sequences of Fig. 1 to which the new model phytase has 

15 the highest degree of homology. For calculating the degree of 
homology, and for making the "alignment according to Fig. 1" of 
the two sequences, the program GAP referred to below is 
preferably used. Having aligned the two sequences, the new model 
phytase (or phytase variant) is added (pasted) to the alignment 

20 at Fig. 1 using the result of the first alignment (placing 
identical and homologous amino acid residues above each other as 
prescribed by the alignment) , following which corresponding 
positions are now easily identifiable. 

Example 7 shows an example of how to add a new model 

25 phytase to the alignment of Fig. 1 and deduce corresponding 
phytase variants thereof. 

Other model phytases can be aligned and variants deduced 
in analogy with Example 7. This is so in particular for the 
following model phytases: The phytase of Aspergillus niger var. 

30 awamori (US patent no. 5,830,733); the Bacillus phytase of 
WO 98/06858; the soy bean phytase of WO 98/20139; the maize 
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phytase of WO 98/05785; the Aspergillus phytase of WO 97/38096; 
the phytases of Monascus anka of WO 98/13480; the phytase from 
Schwanniomyces occidentalis of EP 0699762 etc. 

When comparing a model phytase and a proposed phytase 
5 variant using the alignment as described herein, corresponding 
amino acid positions can be identified, viz. a model position of 
the model phytase and a variant position of the variant - the 
corresponding model position and variant position are simply 
placed one above the other in the alignment. An amendment is 

10 said to have occurred in a given position if the model amino 
acid of the model position and the variant amino acid of the 
variant position are different. Preferred amendments of these 
positions manifest themselves as amino acid substitutions, 
deletions or additions. 

15 Amended in at least one position means amended in one or 

more positions, i.e. in one, two, three, four, five, six, seven, 
eight, nine, ten, eleven, twelve etc. up to all N positions 
listed. This definition includes any possible sub-combinations 
thereof, e.g. any set of two substitutions, any set of three, 

20 any set of four, etc. - to any set of (N-l) positions. 

In the present context all sequences, whatever the model 
phytase, and including the thirteen sequences of Fig. 1, are 
numbered using the numbering corresponding to the phytase 
P__lycii. These "Peniophora numbers" are indicated at Fig. 1, 

25 together with the "alignment numbers. " The numbering of P_lycii 
starts at Ml and ends at E439. 

As explained above, the alignment reveals which positions 
in various phytase sequences other than P_lycii are equivalent 
or corresponding to the given P. lycii position. 

30 A substitution of amino acids is indicated herein as for 

instance "3S," which indicates, that at position 3 amino acid S 
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should be substituted for the "original" or model position 3 
amino acid, whichever it is. Thus, the substitution should 
result in an S in the corresponding variant position. 
Considering now the alignment at Fig. 1, a substitution like 
5 e.g. "3S" is to be interpreted as follows, for the respective 
phytases shown (the amino acid first indicated is the "original" 
or model amino acid in "Peniophora position " 3) : 



10 



15 



20 



P__involtus_Al : 
P_involtus_A2 : 
T_pubescens : 
A_pediades : 
P_lycii : 
A__f umigatus : 
consphyA: 
A_nidulans : 
A_f icuum_NRRL3135 
A__terreus : 
T_thermo : 
T_lanuginosa : 
M_thermophila : 



F3S (number 3 F substituted by S) 

L3S 

MIS 

MIS 

redundant (already an S) 

T5S 

V5S 

T5S 

ASS 

ASS 

L5S 

V11S 

GSS 



However, in what follows the above specific substitutions 
will be designated as follows (always using the Peniophora 
numbering) : 
25 P involtus Al: F3S 



30 



P_involtus_A2 : 
T_pubescens : 
A_pediades : 
P_lycii : 
A_f umigatus : 



L3S 
M3S 
M3S 

redundant (already an S) 
T3S 



consphyA: 



V3S 
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maujLcins . 


m T n 

I oS 


A_ 


_f icuum_NRRL3135 : 


A3S 


A_ 


terreus : 


A3S 


T_ 


thermo : 


L3S 


T_ 


lanuginosa : 


V3S 


M_ 


thermophila : 


G3S 



Still further, denotations like e.g. XX 3S,F,G" means that 
the amino acid in position 3 (Peniophora numbers) of the model 
10 phytase in question is substituted with either of S, F or G, 
i.e. e.g. the designation "3S,F,G" is considered fully 
equivalent to the designation "3S, 3F, 3G" . 

A denotation like ()3S means that amino acid S is added to 
the sequence in question (at a gap in the actual sequence), in a 
15 position corresponding to Peniophora number 3 - and vice versa 
for deletions (S3()). 

In case of regions in which the Peniophora phytase 
sequence has larger deletions than some of the other phytases in 
Fig. 1, for instance in the region between position 201 and 202 
20 (Peniophora numbers) , intermediate positions (amino acid 
residues in other sequences) are numbered by adding a,b,c,d, 
etc, in lower-case letters, to the last Peniophora position 
number, e.g. for the phytase M_thermophila: E201; G201a; P201b; 
Y201c; S201d; T201e; I201f; G202; D203 etc. 
• 5 In one of the priority applications of the present 

application there are two minor position numbering errors: 
According to the above definitions, the positions referred to in 
* first priority application as 204 and 205 (Peniophora 
^rs) are wrongly designated; they should have been numbered 
nd 204, respectively. Therefore, 204 has been substituted 
and 205 by 204 throughout the present application. 
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A preferred phytase variant of the invention comprises an 
amino acid sequence which comprises, preferably contains, one or 
more of the following amino acid substitutions: 

24C; 27P; 31Y; 33C; 39H,S,Q; 40L,N; 42S,G; 

5 43A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y; 44N; 45D,S; 47Y, F; 

49P; 51E,A,R; 56P; 58D,K,A; 59G; 61R; 62V, I; 69Q; 75W, F; 78D, S; 

79G; 80K,A; 81A,G,Q,E; 82T; 83A,I,K,R,Q; 84I,Y,Q,V; 881; 90R,A; 

102Y; 115N; 116S; 118V, L; 119E; 120L; 122A; 123N,Q,T; 125M,S; 

126H,S,V; 127Q,E,N; 128A,S,T; 132F, I,L; 143N; 148V, I; 151A,S; 
10 152G; 153D,Y; 154D,Q,S,G; 157V; 158D,A; 159T; 160A,S; 161T,N; 

162N; 163W; 170fH; 170gA; 171N; 172P; 173Q,S; 184Q,S,P; 185S; 

186A,E,P; 187A; 187aS; 190A, P; 193S; 194S,T; 195T,V,L; 198A,N,V; 

200G,V; 201D,E; a deletion of at least one of 201a, 201b, 201c, 

201d, 201e, 201f, preferably all; 201eT; 202S,A; 203R,K,S; 
15 203aV,T; 204Q, E, S, A, V; 205E; 211L,V; 215A, P; 220L,N; 223H,D; 

228N; 232T; 233E; 235Y,L,T; 236Y,N; 237F; 238L,M; 242P,S; 244D; 

246V; 251eE,Q; 253P; 256D; 260A,H; 264R,I; 265A,Q; 267D; 

270Y,A,L,G; 271D,N; 273D,K; 275F,Y; 278T,H; 280A, P; 283P; 

287A,T; 288L,I,F; 292F,Y; 293A,V; 302R,H; 304P,A; 332F; 336S; 
20 337T,G,Q,S; 3381; 339V, I; 340P,A; 343A, S, F, I, L; 348Y; 349P; 

352K; 360R; 362P; 364W,F; 365V, L, A, S; 366D,S,V; 367A / K; 368K; 

3691, L; 370V; 373A, S; 374S,A; 375H; 376M; 383kQ,E; 387P; 393V; 

396R; 404A,G; 409R; 411K,T; 412R; 417E,R; 421F,Y; 431E. 

In a preferred embodiment this is with the proviso that 
25 the model phytase does not already comprise the above suggested 

amino acid substitution or addition or deletion at the position 

indicated. Or, with the proviso that, for each position, the 

model amino acid is not already the variant amino acid hereby 

proposed. But these provisos can be said to be in fact already 
30 inherent in the above wording, because of the expression 

"amended. " 
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The various preferred phytase variants of claims 16-34 
comprises, preferably contains or have, amino acid sequences 
which comprise or contain one or more of the amino acid 
substitutions, additions, or deletions listed in the respective 
5 claims . 

In a preferred embodiment the various phytase variants 
comprise 1, 2, 3, 4, 5, 6, 7, 8, 9 or even 10 of these 
substitutions; or a number of substitutions of 10-15, 15-20, 20- 
30 or even 30-50; eventually up to 60, 70, 80 or 90 
10 substitutions. 

In another preferred embodiment, the amino acid sequence 
of the various phytase variants comprise one or more 
substitutions of the substitution sub-groupings listed 
hereinbelow; or combinations of substitutions classified in two 

15 or more sub-groupings . 

Generally, instead of "comprise," "contain" or "have," the 
amino acid sequences of preferred variants "consist essentially 
of" or "consist of" the specific model phytases of fig. 1, as 
modified by one or more of the substitutions described herein. 

20 In the present context a basidiomycete means a 

microorganism of the phylum Basidiomycota . This phylum of 
Basidiomycota is comprised in the fungal kingdom together with 
e.g. the phylum Ascomycota ("ascomycetes") . 

Taxonomical questions can be clarified by consulting the 

25 references listed below or by consulting a fungal taxonomy 
database (NIH Data Base (Entrez) ) which is available via the 
Internet on World Wide Web at the following address: 
http : / /www3 . ncbi . nlm. nih . gov/Taxonomy /tax . html . 

For a definition of basidiomycetes, reference is made to 

30 either Jiilich, 1981, Higher Taxa of Basidiomycetes; Ainsworth & 
Bisby's (eds.) Dictionary of the Fungi, 1995, Hawksworth, D.L., 
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P.M. Kirk, B.C. Sutton & D.N. Pegler; or Hansen & Knudsen 
(Eds.), Nordic Macromycetes, vol. 2 (1992) and 3 (1997). A 
preferred reference is Hansen & Knudsen. 

For a definition of ascomycetes, reference is made to 
5 either of Ainsworth & Brisby cited above or Systema Ascomycetum 
by Eriksson, O.E. & D. L. Hawksworth, Vol. 16, 1998. A preferred 
reference is Eriksson et al. 

Generally, a microorganism which is classified as a 
basidiomycete/ascomycete in either of the references listed 
10 above, including the database, is a basidiomycete/ascomycete in 
the present context. 

Some Aspergillus strains are difficult to classify because 
they are anamorphous, and therefore they might be classified in 
Fungi Imperfecti. However, once the teleomorphous counterpart is 
15 found, it is re-classified taxonomically . For instance, the 
teleomorph of A. nidulans is Emericella nidulans (of the family 
Trichocomaceae, the order Eurotiales, the class Plectomycetes of 
the phylum Ascomycota) . These subgroupings of Ascomycota are 
preferred, together with the family Lasiosphaeriaceae, the order 
20 Sordariales, the class Pyrenomycetes of the phylum Ascomycota. 

The wording "ascomycetes" and analogues as used herein 
includes any strains of Aspergillus, Thermomyces, 
Myceliophthora, Talaromyces which are anamorphous and thus would 
be classified in Fungi Imperfecti. 
25 Preferred basidiomycete phytases are those listed in 

WO 98/28409, in the very beginning of the section headed 
"Detailed description of the invention". 

DNA sequences encoding the thirteen specifically listed 
model phytases and other model phytases can be prepared 
30 according to the teachings of each of the documents listed under 
the brief description of the drawings. 
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A DNA sequence encoding a model phytase may be isolated 
from any cell or microorganism producing the phytase in 
question, using various methods well known in the art. First, a 
genomic DNA and/or cDNA library should be constructed using 
5 chromosomal DNA or messenger RNA from the organism that produces 
the phytase. Then, if the amino acid sequence of the phytase is 
known, homologous, labelled oligonucleotide probes may be 
synthesized and used to identify phytase-encoding clones from a 
genomic library prepared from the organism in question. 
10 Alternatively, a labelled oligonucleotide probe containing 
sequences homologous to a known phytase gene could be used as a 
probe to identify phytase-encoding clones, using hybridization 
and washing conditions of lower stringency. 

Yet another method for identifying phytaseencoding clones 
15 would involve inserting fragments of genomic DNA into an 
expression vector, such as a plasmid, transforming phytase- 
negative bacteria with the resulting genomic DNA library, and 
then plating the transformed bacteria onto agar containing a 
substrate for phytase thereby allowing clones expressing the 
20 phytase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be 
prepared synthetically by established standard methods, e.g. the 
phosphoroamidite method described by S.L. Beaucage and M.H. 
Caruthers (1981) or the method described by Matthes et al. 
25 (1984). m the phosphoroamidite method, oligonucleotides are 
synthesized, e.g. in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 

Finally, the DNA sequence may be of mixed genomic and syn- 
thetic origin, mixed synthetic and cDNA origin or mixed genomic 
30 and cDNA origin, prepared by ligating fragments of synthetic, 
genomic or cDNA origin (as appropriate, the fragments 
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corresponding to various parts of the entire DNA sequence) , in 
accordance with standard techniques. The DNA sequence may also 
be prepared by polymerase chain reaction (PCR) using specific 
primers, for instance as described in US 4,683,202 or R.K. Saiki 
5 et al. (1988) . 

DNA encoding the phytase variants of the present invention 
can be prepared by methods known in the art, such as Site- 
directed Mutagenesis. Once a DNA sequence encoding a model 
phytase of interest has been isolated, and desirable sites for 

10 mutation identified, mutations may be introduced using synthetic 
oligonucleotides. These oligonucleotides contain nucleotide 
sequences flanking the desired mutation sites; mutant 
nucleotides are inserted during oligonucleotide synthesis. In a 
specific method, a single-stranded gap of DNA, bridging the 

15 phytase-encoding sequence, is created in a vector carrying the 
phytase-encoding gene. Then the synthetic nucleotide, bearing 
the desired mutation, is annealed to a homologous portion of the 
single-stranded DNA. The remaining gap is then filled in with 
DNA polymerase I (Klenow fragment) and the construct is ligated 

20 using T4 ligase. A specific example of this method is described 
in Morinaga et al. (1984). US 4,760,025 discloses the 

introduction of oligonucleotides encoding multiple mutations by 
performing minor alterations of the cassette. However, an even 
greater variety of mutations can be introduced at any one time 

25 by the Morinaga method because a multitude of oligonucleotides, 
of various lengths, can be introduced. 

Another method of introducing mutations into DNA sequences 
encoding a desired model phytase is described in Nelson and Long 
(1989) . It involves a 3-step generation of a PCR fragment 

30 containing the desired mutation introduced by using a chemically 
synthesized DNA strand as one of the primers in the PCR 
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reactions- From the PCR-generated fragment, a DNA fragment 
carrying the mutation may be isolated by cleavage with 
restriction endonucleases and reinserted into an expression 
plasmid. 

5 Yet another method of mutating DNA sequences encoding a 

model phytase is Random Mutagenesis. Random mutagenesis is 
suitably performed either as localised or region-specific random 
mutagenesis in at least three parts of the gene translating to 
the amino acid sequence shown in question, or within the whole 
10 gene. 

The random mutagenesis of a DNA sequence encoding a model 
phytase may be conveniently performed by use of any method known 
in the art. 

In relation to the above, further aspects of the present 
15 invention relates to a method for generating a variant of a 
model phytase, wherein the variant preferably exhibits amended 
characteristics as described below, the method comprising: 

(a) subjecting a DNA sequence encoding the model phytase 
to Site-directed Mutagenesis, or the Nelson and Long PCR 

20 mutagenesis method or to Random Mutagenesis, 

(b) expressing the mutated DNA sequence obtained in step 
(a) in a host cell, and 

(c) screening for host cells expressing a phytase 
variant which has an altered property relative to the model 

25 phytase. 

When using Random Mutagenesis, step (a) of the above 
method of the invention is preferably performed using doped 
primers . 

For instance, the random mutagenesis may be performed by 
30 use of a suitable physical or chemical mutagenizing agent, by 
use of a suitable oligonucleotide, or by subjecting the DNA 
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sequence to PCR generated mutagenesis. Furthermore, the random 
mutagenesis may be performed by use of any combination of these 
mutagenizing agents. The mutagenizing agent may, e.g., be one 
which induces transitions, transversions, inversions, 
5 scrambling, deletions, and/or insertions. 

Examples of a physical or chemical mutagenizing agent 
suitable for the present purpose include ultraviolet (UV) 
irradiation, hydroxy lamine, N-methyl-N 9 -nitro-N-nitrosoguanidine 
(MNNG) , 0-methyl hydroxy lamine, nitrous acid, ethyl methane 

10 sulphonate (EMS), sodium bisulphite, formic acid, and nucleotide 
analogues. When such agents are used, the mutagenesis is 
typically performed by incubating the DNA sequence encoding the 
parent enzyme to be mutagenized in the presence of the 
mutagenizing agent of choice under suitable conditions for the 

15 mutagenesis to take place, and selecting for mutated DNA having 
the desired properties. 

When the mutagenesis is performed by the use of an 
oligonucleotide, the oligonucleotide may be doped or spiked with 
the three non-parent nucleotides during the synthesis of the 

20 oligonucleotide at the positions which are to be changed. The 
doping or spiking may be done so that codons for unwanted amino 
acids are avoided. The doped or spiked oligonucleotide can be 
incorporated into the DNA encoding the phytase enzyme by any 
published technique, using e.g. PCR, LCR or any DNA polymerase 

25 and ligase as deemed appropriate. 

Preferably, the doping is carried out using "constant 
random doping", in which the percentage of wild-type and 
mutation in each position is predefined. Furthermore, the 
doping may be directed toward a preference for the introduction 

30 of certain nucleotides, and thereby a preference for the 
introduction of one or more specific amino acid residues. The 
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doping may be made, e.g., so as to allow for the introduction 
of 90% wild type and 10% mutations in each position. An 
additional consideration in the choice of a doping scheme is 
based on genetic as well as protein-structural constraints. The 
5 doping scheme may be made by using the DOPE program which, inter 
alia, ensures that introduction of stop codons is avoided. 

When PCR-generated mutagenesis is used, either a 
chemically treated or non-treated gene encoding a model phytase 
is subjected to PCR under conditions that increase the mis- 

10 incorporation of nucleotides (Deshler 1992; Leung et al., 
Technique, Vol.1, 1989, pp. 11-15). 

A mutator strain of E. coli (Fowler et al., Molec. Gen. 
Genet., 133, 1974, pp. 179-191), S. cereviseae or any other 
microbial organism may be used for the random mutagenesis of the 

15 DNA encoding the model phytase by, e.g., transforming a plasmid 
containing the parent glycosylase into the mutator strain, 
growing the mutator strain with the plasmid and isolating the 
mutated plasmid from the mutator strain. The mutated plasmid 
may be subsequently transformed into the expression organism. 

20 The DNA sequence to be mutagenized may be conveniently 

present in a genomic or cDNA library prepared from an organism 
expressing the model phytase. Alternatively, the DNA sequence 
may be present on a suitable vector such as a plasmid or a 
bacteriophage, which as such may be incubated with or otherwise 

25 exposed to the mutagenising agent. The DNA to be mutagenized 
may also be present in a host cell either by being integrated in 
the genome of said cell or by being present on a vector 
harboured in the cell. Finally, the DNA to be mutagenized may 
be in isolated form. It will be understood that the DNA 

30 sequence to be subjected to random mutagenesis is pre-ferably a 
cDNA or a genomic DNA sequence. 
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In some cases it may be convenient to amplify the mutated 
DNA sequence prior to performing the expression step b) or the 
screening step c) . Such amplification may be performed in 
accordance with methods known in the art, the presently 
5 preferred method being PCR-generated amplification using 
oligonucleotide primers prepared on the basis of the DNA or 
amino acid sequence of the parent enzyme. 

Subsequent to the incubation with or exposure to the 
mutagenising agent, the mutated DNA is expressed by culturing a 

10 suitable host cell carrying the DNA sequence under conditions 
allowing expression to take place. The host cell used for this 
purpose may be one which has been transformed with the mutated 
DNA sequence, optionally present on a vector, or one which was 
carried the DNA sequence encoding the parent enzyme during the 

15 mutagenesis treatment. Examples of suitable host cells are the 
following: gram positive bacteria such as Bacillus subtilis, 
Bacillus lichenif ormis, Bacillus lentus, Bacillus brevis, 
Bacillus stearothermophilus, Bacillus alkalophilus, Bacillus 
amyloliquef aciens, Bacillus coagulans, Bacillus circulans, 

20 Bacillus lautus, Bacillus megaterium, Bacillus thuringiensis, 
Streptomyces lividans or Streptomyces murinus; and gram- 
negative bacteria such as E. coli. 

The mutated DNA sequence may further comprise a DNA 
sequence encoding functions permitting expression of the mutated 

25 DNA sequence. 

The random mutagenesis may be advantageously localised to 
a part of the model phytase in question using Localized random 
mutagenesis. This may, e.g., be advantageous when certain 
regions of the enzyme have been identified to be of particular 
30 importance for a given property of the enzyme, and when modified 
are expected to result in a variant having improved properties. 
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Such regions may normally be identified when the tertiary 
structure of the parent enzyme has been elucidated and related 
to the function of the enzyme. 

The localized, or region-specif ic, random mutagenesis is 
5 conveniently performed by use of PCR generated mutagenesis 
techniques as described above or any other suitable technique 
known in the art. Alternatively , the DNA sequence encoding the 
part of the DNA sequence to be modified may be isolated, e.g., 
by insertion into a suitable vector, and said part may be 

10 subsequently subjected to mutagenesis by use of any of the 
mutagenesis methods discussed above. 

For region-specific random mutagenesis with a view to 
amending e.g. the specific activity of a model phytase, codon 
positions corresponding to the following amino acid residues 

15 from the amino acid sequences set forth in Fig. 1 may 
appropriately be targeted: 

Residues: 41-47, 68-80, 83-84, 115-118, 120-126, 128, 
149-163, 184-185, 191-193, 198-201e, 202-203, 205, 235-236, 238- 
239, 242-243, 270-279, 285, 288, 332-343, 364-367, 369-375, 394. 

20 Regions: 41-47, 68-80, 120-128, 149-163, 270-279, 332-343, 

364-375. 

The random mutagenesis may be carried out by the following 

steps : 

1. Select regions of interest for modification in the 
25 parent enzyme 

2. Decide on mutation sites and non-mutated sites in 
the selected region 

3. Decide on which kind of mutations should be carried 
out, e.g. with respect to the desired stability and/or 

30 performance of the variant to be constructed 

4. Select structurally reasonable mutations 
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5. Adjust the residues selected by step 3 with regard 
to step 4 . 

6. Analyse by use of a suitable dope algorithm the 
nucleotide distribution . 

5 7. If necessary, adjust the wanted residues to genetic 

code realism, e.g. taking into account constraints resulting 
from the genetic code, e.g. in order to avoid introduction of 
stop codons; the skilled person will be aware that some codon 
combinations cannot be used in practice and will need to be 
10 adapted 

8. Make primers 

9. Perform random mutagenesis by use of the primers 

10. Select resulting phytase variants by screening for 
the desired improved properties. 

15 Suitable dope algorithms for use in step 6 are well known 

in the art. One such algorithm is described by Tomandl, D. et 
al., 1997, Journal of Computer-Aided Molecular Design 11:29-38. 
Another algorithm is DOPE (Jensen, LJ, Andersen, KV, Svendsen, 
A, and Kretzschmar, T (1998) Nucleic Acids Research 26:697-702). 

20 A DNA sequence encoding a model phytase or a phytase 

variant of the invention can be expressed using an expression 
vector, a recombinant expression vector, which typically 
includes control sequences encoding a promoter, operator, 
ribosome binding site, translation initiation signal, and, 

25 optionally, a repressor gene or various activator genes. 

The recombinant expression vector may be any vector which 
may conveniently be subjected to recombinant DNA procedures, and 
the choice of vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an 

30 autonomously replicating vector, e.g. a plasmid, a bacteriophage 
or an extra-chromosomal element. Alternatively, the vector may 
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be one which, when introduced into a host cell, is integrated 
into the host cell genome and replicated together with the 
chromosome (s) into which it has been integrated. 

In the vector, the DNA sequence should be operably 
5 connected to a suitable promoter sequence. The promoter may be 
any DNA sequence which shows transcriptional activity in the 
host cell of choice and may be derived from genes encoding 
proteins either homologous or heterologous to the host cell. An 
example of a suitable promoter for directing the transcription 

10 of the DNA sequence encoding a phytase variant of the invention, 
especially in a bacterial host, is the promoter of the lac 
operon of E.coli. For transcription in a fungal host, examples 
of useful promoters are those derived from the gene encoding A. 
oryzae TAKA amylase. 

15 The expression vector of the invention may also comprise a 

■ 

suitable transcription terminator and, in eukaryotes, 
polyadenylation sequences operably connected to the DNA sequence 
encoding the phytase variant of the invention. Termination and 
polyadenylation sequences may suitably be derived from the same 

20 sources as the promoter. 

The vector may further comprise a DNA sequence enabling 
the vector to replicate in the host cell in question. Examples 
of such sequences are the origins of replication of plasmids 
pUC19, pACYC177, pUBHO, pE194, pAMBl and pIJ702. 

25 The vector may also comprise a selectable marker, e.g. a 

gene the product of which complements a defect in the host cell, 
such as the dal genes from B. subtilis or B. lichenif ormis, or 
one which confers antibiotic resistance such as ampicillin 
resistance. Furthermore, the vector may comprise Aspergillus 

30 selection markers such as amdS, argB, niaD and sC, or the 
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selection may be accomplished by co-transformation, e.g. as 
described in WO 91/17243. 

The procedures used to ligate the DNA construct of the 
invention encoding a phytase variant, the promoter, terminator 
5 and other elements, respectively, and to insert them into 
suitable vectors containing the information necessary for 
replication, are well known to persons skilled in the art (cf., 
for instance, Sambrook et al . (1989)). 

The cell of the invention, either comprising a DNA 

10 construct or an expression vector of the invention as defined 
above, is advantageously used as a host cell in the recombinant 
production of a phytase variant of the invention. The cell may 
be transformed with the DNA construct of the invention encoding 
the variant, conveniently by integrating the DNA construct (in 

15 one or more copies) in the host chromosome. This integration is 
generally considered to be an advantage as the DNA sequence is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g. by homologous or 

20 heterologous recombination. Alternatively, the cell may be 
transformed with an expression vector as described above in 
connection with the different types of host cells. 

An isolated DNA molecule or, alternatively, a "cloned DNA 
sequence" "a DNA construct, " "a DNA segment" or "an isolated DNA 

25 sequence" refers to a DNA molecule or sequence which can be 
cloned in accordance with standard cloning procedures used in 
genetic engineering to relocate the DNA segment from its natural 
location to a different site where it will be replicated. The 
term refers generally to a nucleic acid sequence which is 

30 essentially free of other nucleic acid sequences, e.g., at least 
about 20% pure, preferably at least about 40% pure, more 
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preferably about 60% pure, even more preferably about 80% pure, 
most preferably about 90% pure, and even most preferably about 
95% pure, as determined by agarose gel electrophoresis. The 
cloning procedures may involve excision and isolation of a 
5 desired nucleic acid fragment comprising the nucleic acid 
sequence encoding the polypeptide, insertion of the fragment 
into a vector molecule, and incorporation of the recombinant 
vector into a host cell where multiple copies or clones of the 
nucleic acid sequence will be replicated. The nucleic acid 

10 sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic 
origin, or any combinations thereof. 

The term "vector" is intended to include such 
terms/objects as "nucleic acid constructs," "DNA constructs," 
expression vectors" or "recombinant vectors." 

15 The nucleic acid construct comprises a nucleic acid 

sequence of the present invention operably linked to one or more 
control sequences capable of directing the expression of the 
coding sequence in a suitable host cell under conditions 
compatible with the control sequences. 

20 "Nucleic acid construct" is defined herein as a nucleic 

acid molecule, either single or double-stranded, which is 
isolated from a naturally occurring gene or which has been 
modified to contain segments of nucleic acid which are combined 
and juxtaposed in a manner which would not otherwise exist in 

25 nature. 

The term nucleic acid construct may be synonymous with the 
term expression cassette when the nucleic acid construct 
contains all the control sequences required for expression of a 
coding sequence of the present invention. 
30 The term "coding sequence" as defined herein primarily 

comprises a sequence which is transcribed into mRNA and 
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translated into a polypeptide of the present invention when 
placed under the control of the above mentioned control 
sequences. The boundaries of the coding sequence are generally 
determined by a translation start codon ATG at the 5' -terminus 
5 and a translation stop codon at the 3' -terminus. A coding 
sequence can include, but is not limited to, DNA, cDNA, and 
recombinant nucleic acid sequences. 

The term "control sequences" is defined herein to include 
all components which are necessary or advantageous for 

10 expression of the coding sequence of the nucleic acid sequence. 
Each control sequence may be native or foreign to the nucleic 
acid sequence encoding the polypeptide. Such control sequences 
include, but are not limited to, a leader, a polyadenylation 
sequence, a propeptide sequence, a promoter, a signal sequence, 

15 and a transcription terminator. At a minimum, the control 
sequences include a promoter, and transcriptional and 
translational stop signals. The control sequences may be 
provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control sequences 

20 with the coding region of the nucleic acid sequence encoding a 
polypeptide . 

A "host cell" or "recombinant host cell" encompasses any 
progeny of a parent cell which is not identical to the parent 
cell due to mutations that occur during replication. 

25 The cell is preferably transformed with a vector 

comprising a nucleic acid sequence of the invention followed by 
integration of the vector into the host chromosome. 

"Transformation" means introducing a vector comprising a 
nucleic acid sequence of the present invention into a host cell 

30 so that the vector is maintained as a chromosomal integrant or 
as a self -replicating extra-chromosomal vector. Integration is 
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generally considered to be an advantage as the nucleic acid 
sequence is more likely to be stably maintained in the cell. 
Integration of the vector into the host chromosome may occur by 
homologous or non-homologous recombination as described above. 
5 The host cell may be a unicellular microorganism, e.g., a 

prokaryote, or a non-unicellular microorganism, e.g., a 
eukaryote. Examples of a eukaryote cell is a mammalian cell, an 
insect cell, a plant cell or a fungal cell. Useful mammalian 
cells include Chinese hamster ovary (CHO) cells, HeLa cells, 

10 baby hamster kidney (BHK) cells, COS cells, or any number of 
other immortalized cell lines available, e.g., from the American 
Type Culture Collection. 

In a preferred embodiment, the host cell is a fungal cell. 
Fungal cells may be transformed by a process involving 

15 protoplast formation, transformation of the protoplasts, and 
regeneration of the cell wall in a manner known per se. 

The present invention also relates to a transgenic plant, 
plant part, such as a plant seed, or plant cell, which has been 
transformed with a DNA sequence encoding the phytase of the 

20 invention so as to express or produce this enzyme. Also 
compositions and uses of such plant or plant part are within the 
scope of the invention, especially its use as feed and food or 
additives therefore, along the lines of the present use and 
food/feed claims. 

25 The transgenic plant can be dicotyledonous or 

monocotyledonous, for short a dicot or a monocot. Of primary 
interest are such plants which are potential food or feed 
components and which comprise phytic acid. A normal phytic acid 
level of feed components is 0.1-100 g/kg, or more usually 0.5-50 

30 g/kg, most usually 0.5-20 g/kg. Examples of monocot plants are 
grasses, such as meadow grass (blue grass, Poa) , forage grass 
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such as festuca, lolium, temperate grass, such as Agrostis, and 
cereals, e.g. wheat, oats, rye, barley, rice, sorghum and maize 
(corn) . 

Examples of dicot plants are legumes, such as lupins, pea, 
5 bean and soybean, and cruciferous (family Brassicaceae) , such as 
cauliflower, oil seed rape and the closely related model 
organism Arabidopsis thaliana. 

Such transgenic plant etc. is capable of degrading its own 
phytic acid, and accordingly the need for adding such enzymes to 
10 food or feed comprising such plants is alleviated. Preferably, 
the plant or plant part, e.g. the seeds, are ground or milled, 
and possibly also soaked before being added to the food or feed 
or before the use, e.g. intake, thereof, with a view to adapting 
the speed of the enzymatic degradation to the actual use. 
15 If desired, the plant produced enzyme can also be 

recovered from the plant. In certain cases the recovery from the 
plant is to be preferred with a view to securing a heat stable 
formulation in a potential subsequent pelleting process. 

Examples of plant parts are stem, callus, leaves, root, 
20 fruits, seeds, tubers etc. But also any plant tissue is included 
in this definition. 

Any plant cell, whatever the tissue origin, is included in 
the definition of plant cells above. 

Also included within the scope of the invention are the 
25 progeny of such plants, plant parts and plant cells. 

The skilled man will know how to construct a DNA 
expression construct for insertion into the plant in question, 
paying regard i.a. to whether the enzyme should be excreted in a 
tissue specific way. Of relevance for this evaluation is the 
30 stability (pH-stability, degradability by endogenous proteases 
etc.) of the phytase in the expression compartments of the 
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plant. He will also be able to select appropriate regulatory 
sequences such as promoter and terminator sequences, and signal 
or transit sequences if required (Tague et al, Plant, Phys . , 86, 
506, 1988). 

5 The plant, plant part etc. can be transformed with this 

DNA construct using any known method. An example of such method 
is the transformation by a viral or bacterial vector such as 
bacterial species of the genus Agrobacterium genetically 
engineered to comprise the gene encoding the phytase of the 

10 invention. Also methods of directly introducing the phytase DNA 
into the plant cell or plant tissue are known in the art, e.g. 
micro injection and electroporation (Gasser et al, Science, 244, 
1293; Potrykus, Bio/Techn. 8, 535, 1990; Shimamoto et al, 
Nature, 338, 274, 1989). 

15 Following the transformation, the transf ormants are 

screened using any method known to the skilled man, following 
which they are regenerated into whole plants. 

These plants etc. as well as their progeny then carry the 
phytase encoding DNA as a part of their genetic equipment. 

20 In general, reference is made to WO 9114782A and WO 

9114772A. 

Agrobacterium tumefaciens mediated gene transfer is the 
method of choice for generating transgenic dicots (for review 
Hooykas & Schilperoort , 1992. Plant Mol. Biol. 19: 15-38), 

25 however it can also be used for transforming monocots. Due to 
host range limitations it is generally not possible to transform 
monocots with the help of A. tumefaciens. Here, other methods 
have to be employed. The method of choice for generating 
transgenic monocots is particle bombardment (microscopic gold or 

30 tungsten particles coated with the transforming DNA) of 
embryonic calli or developing embryos (Christou, 1992. Plant J. 
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2: 275-281; Shimamoto, 1994. Curr. Opin. Biotechnol. 5: 158-162; 
Vasil et al., 1992. Bio/Technology 10: 667-674). 

Also other systems for the delivery of free DNA into these 
plants, including viral vectors (Joshi & Joshi, 1991. FEBS Lett. 
5 281: 1-8), protoplast transformation via polyethylene glycol or 
electroporation (for review see Potyrkus, 1991. Annu. Rev. Plant 
Physiol. Plant Mol. Biol. 42: 205-225), microinjection of DNA 
into mesophyll protoplasts (Crossway et al., 1986. Mol. Gen. 
Genet. 202: 79-85), and macroinjection of DNA into young floral 

10 tillers of cereal plants (de la Pena et al., 1987. Nature 325: 
274-276) are preferred methods. 

In general, the cDNA or gene encoding the phytase variant 
of the invention is placed in an expression cassette (e.g. 
Pietrzak et al., 1986. Nucleic Acids Res. 14: 5857-5868) 

15 consisting of a suitable promoter active in the target plant and 
a suitable terminator (termination of transcription) . This 
cassette (of course including a suitable selection marker, see 
below) will be transformed into the plant as such in case of 
monocots via particle bombardment. In case of dicots the 

20 expression cassette is placed first into a suitable vector 
providing the T-DNA borders and a suitable selection marker 
which in turn are transformed into Agrobacterium tumefaciens. 
Dicots will be transformed via the Agrobacterium harbouring the 
expression cassette and selection marker flanked by T-DNA 

25 following standard protocols (e.g. Akama et al., 1992. Plant 
Cell Reports 12: 7-11). The transfer of T-DNA from Agrobacterium 
to the Plant cell has been recently reviewed (Zupan & Zambryski, 
1995. Plant Physiol. 107: 1041-1047). Vectors for plant 
transformation via Agrobacterium are commercially available or 

30 can be obtained from many labs that construct such vectors (e.g. 
Deblaere et al., 1985. Nucleic Acids Res. 13: 4777-4788; for 
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review see Klee et'al., 1987. Annu. Rev. Plant Physiol. 38: 467- 
486) . 

Available plant promoters: Depending on the process under 
manipulation, organ- and/or cell-specific expression as well as 
5 appropriate developmental and environmental control may be 
required. For instance, it is desirable to express a phytase 
cDNA in maize endosperm etc. The most commonly used promoter has 
been the constitutive 35S-CaMV promoter Franck et al., 1980. 
Cell 21: 285-294). Expression will be more or less equal 

10 throughout the whole plant. This promoter has been used 
successfully to engineer herbicide- and pathogen-resistant 
plants (for review see Stitt & Sonnewald, 1995. Annu. Rev. Plant 
Physiol. Plant Mol. Biol. 46: 341-368). Organ-specific promoters 
have been reported for storage sink tissues such as seeds, 

15 potato tubers, and fruits (Edwards & Coruzzi, 1990. Annu. Rev. 
Genet. 24: 275-303), and for metabolic sink tissues such as 
meristems (Ito et al . , 1994. Plant Mol. Biol. 24: 863-878). 

The medium used to culture the transformed host cells may 
be any conventional medium suitable for growing the host cells 

20 in question. The expressed phytase may conveniently be secreted 
into the culture medium and may be recovered therefrom by well- 
known procedures including separating the cells from the medium 
by centrifugation or filtration, precipitating proteinaceous 
com-ponents of the medium by means of a salt such as ammonium 

25 sulphate, followed by chromatographic procedures such as ion 
exchange chromatography, affinity chromatography, or the like. 

Preferred host cells are a strain of Fusarium, Hansenula, 
Trichoderma or Aspergillus, in particular a strain of Fusarium 
graminearum, Fusarium venenatum, Fusarium cerealis, Fusarium sp. 

30 having the identifying characteristic of Fusarium ATCC 20334, as 
further described in PCT/US/95/07743, Hansenula polymorpha, 
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Trichoderma harzianum or Trichoderma reesei, Aspergillus niger 
or Aspergillus oryzae. 

References for expression in Hansenula polymorpha: 
Gellissen, G., Piontek, M., Dahlems, U., Jenzelewski, V., 
5 Gavagan, J.E., DiCosimo, R. , Anton, D.I. & Janowicz, Z.A. (1996) 
Recombinant Hansenula polymorpha as a biocatalyst: coexpression 
of the spinach glycolate oxidase (GO) and the S. cerevisiae 
catalase T (CTT1) gene. Appl . Microbiol. Biotechnol. 4 6, 4 6-54. 

Some more specific uses of the phytase variants according 

10 to the invention appear from PCT/DK97/00568, the last pages of 
the detailed description of the invention section. 

In a preferred embodiment, the phytase variant of the 
invention is essentially free of other non-phytase polypeptides, 
e.g., at least about 20% pure, preferably at least about 40% 

15 pure, more preferably about 60% pure, even more preferably about 
80% pure, most preferably about 90% pure, and even most 
preferably about 95% pure, as determined by SDS-PAGE. Sometimes 
such polypeptide is alternatively referred to as a "purified" 
and/or "isolated" phytase. 

20 A phytase polypeptide which comprises a phytase variant of 

the invention includes fused polypeptides or cleavable fusion 
polypeptides in which another polypeptide is fused at the N- 
terminus or the C-terminus of the polypeptide or fragment 
thereof. A fused polypeptide is produced by fusing a nucleic 

25 acid sequence (or a portion thereof) encoding another 
polypeptide to a nucleic acid sequence (or a portion thereof) 
encoding a phytase variant of the present invention. Techniques 
for producing fusion polypeptides are known in the art, and 
include, ligating the coding sequences encoding the polypeptides 

30 so that they are in frame and that expression of the fused 
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polypeptide is under control of the same promoter (s) and 
terminator. 

A "feed" and a "food," respectively, means any natural or 
artificial diet, meal or the like or components of such meals 
5 intended or suitable for being eaten, taken in, digested, by an 
animal and a human being, respectively. 

The phytase variant of the invention may exert its effect 
in vitro or in vivo, i.e. before intake or in the stomach of the 
individual, respectively. Also a combined action is possible. 
10 A phytase composition according to the invention always 

comprises at least one phytase of the invention. 

Generally, phytase compositions are liquid or dry. 
Liquid compositions need not contain anything more than 
the phytase enzyme, preferably in a highly purified form. 
15 Usually, however, a stabilizer such as glycerol, sorbitol or 
mono propylen glycol is also added. The liquid composition may 
also comprise other additives, such as salts, sugars, 
preservatives, pH-adjusting agents, proteins, phytate (a phytase 
substrate) . Typical liquid compositions are aqueous or oil-based 
20 slurries. The liquid compositions can be added to a food or feed 
after an optional pelleting thereof. 

Dry compositions may be spray-dried compositions, in which 
case the composition need not contain anything more than the 
enzyme in a dry form. Usually, however, dry compositions are so- 
25 called granulates which may readily be mixed with e.g. food or 
feed components, or more preferably, form a component of a pre- 
mix. The particle size of the enzyme granulates preferably is 
compatible with that of the other components of the mixture. 
This provides a safe and convenient means of incorporating 
30 enzymes into e.g. animal feed. 
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Agglomeration granulates are prepared using agglomeration 
technique in a high shear mixer (e.g. Lodige) during which a 
filler material and the enzyme are co-agglomerated to form 
granules. Absorption granulates are prepared by having cores of 
5 a carrier material to absorb/be coated by the enzyme. 

Typical filler materials are salts such as disodium 
sulphate. Other fillers are kaolin, talc, magnesium aluminium 
silicate and cellulose fibres. Optionally, binders such as 
dextrins are also included in agglomeration granulates. 
10 Typical carrier materials are starch, e.g. in the form of 

cassava, corn, potato, rice and wheat. Salts may also be used. 

Optionally, the granulates are coated with a coating 
mixture. Such mixture comprises coating agents, preferably 
hydrophobic coating agents, such as hydrogenated palm oil and 
15 beef tallow, and if desired other additives, such as calcium 
carbonate or kaolin. 

Additionally, phytase compositions may contain other 
substituents such as colouring agents, aroma compounds, 
stabilizers, vitamins, minerals, other feed or food enhancing 
20 enzymes, i.e. enzymes that enhances the nutritional properties 
of feed/food, etc. This is so in particular for the so-called 
pre-mixes. 

A "food or feed additive" is an essentially pure compound 
or a multi component composition intended for or suitable for 

25 being added to food or feed. In particular it is a substance 
which by its intended use is becoming a component of a food or 
feed product or affects any characteristics of a food or feed 
product. It is composed as indicated for phytase compositions 
above. A typical additive usually comprises one or more 

30 compounds such as vitamins, minerals or feed enhancing enzymes 
and suitable carriers and/or excipients. 
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In a preferred embodiment, the phytase compositions of the 
invention additionally comprises an effective amount of one or 
more feed enhancing enzymes, in particular feed enhancing 
enzymes selected from the group consisting of ot-galactosidases, 
5 p-galactosidases, in particular lactases, other phytases, p- 
glucanases, in particular endo-p-1, 4-glucanases and endo-p- 
1, 3 (4) -glucanases, cellulases, xylosidases, galactanases, in 
particular arabinogalactan endo-1, 4-p-galactosidases and 
arabinogalactan endo-1, 3-p~galactosidases, endoglucanases, in 

10 particular endo-1, 2-p-glucanase, endo-1, 3-a-glucanase, and endo- 
1, 3-p-glucanase, pectin degrading enzymes, in particular 
pectinases, pectinesterases, pectin lyases, polygalacturonases, 
arabinanases , rhamnogalacturonases , rhamnogalacturonan acetyl 
esterases, rhamnogalacturonan-a-rhamnosidase, pectate lyases, 

15 and a-galacturonisidases, mannanases, p-mannosidases, mannan 
acetyl esterases, xylan acetyl esterases, proteases, xylanases, 
arabinoxylanases and lipolytic enzymes such as lipases, 
phospholipases and cutinases. 

The animal feed additive of the invention is supplemented 

20 to the mono-gastric animal before or simultaneously with the 
diet. Preferably, the animal feed additive of the invention is 
supplemented to the mono-gastric animal simultaneously with the 
diet. In a more preferred embodiment, the animal feed additive 
is added to the diet in the form of a granulate or a stabilized 

25 liquid. 

An effective amount of phytase in food or feed is from 
about 10-20.000; preferably from about 10 to 15.000, more 
preferably from about 10 to 10.000, in particular from about 100 
to 5.000, especially from about 100 to about 2.000 FYT/kg feed 
30 or food. 
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Examples of other specific uses of the phytase of the 
invention is in soy processing and in the manufacture of 
inositol or derivatives thereof. 

The invention also relates to a method for reducing 
5 phytate levels in animal manure, wherein the animal is fed a 
feed comprising an effective amount of the phytase of the 
invention. 

Also comprised in this invention is the use of a phytase 
of the invention during the preparation of food or feed 

10 preparations or additives, i.e. the phytase exerts its phytase 
activity during the manufacture only and is not active in the 
final food or feed product. This aspect is relevant for instance 
in dough making and baking. 

The invention relates to a phytase variant which, when 

15 aligned according to Fig. 1, is amended as compared to a model 
phytase in at least one of the following positions, using the 
position numbering corresponding to P_lycii: 

24; 27; 31; 33; 39; 40; 41; 42; 43; 44; 45; 46; 47; 49; 51; 56; 

58; 59; 61; 62; 68; 69; 70; 71; 72; 73; 74; 75; 76; 77; 78; 79; 
20 80; 81; 82; 83; 84; 88; 90; 102; 115; 116; 117; 118; 119; 120; 

121; 122; 123; 124; 125; 126; 127; 128; 132; 143; 148; 149; 150; 

151; 152; 153; 154; 155; 156; 157; 158; 159; 160; 161; 162; 163; 

170f; 170g; 171; 172; 173; 184; 185; 186; 187; 187a; 190; 191; 

192; 193; 194; 195; 198; 199; 200; 201; 201a; 201b; 201c; 201d; 
25 201e; 201f; 202; 203; 203a; 204; 205; 211; 215; 220; 223; 228; 

232; 233; 234; 235; 236; 237; 238; 239; 242; 243; 244; 246; 

251e; 253; 256; 260; 264; 265; 267; 270; 271; 272; 273; 274; 

275; 276; 277; 278; 279; 280; 283; 285; 287; 288; 292; 293; 302; 

304; 332; 333; 334; 335; 336; 337; 338; 339; 340; 341; 342; 343; 
30 348; 349; 352; 360; 362; 364; 365; 366; 367; 368; 369; 370; 371; 
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372; 373; 374; 375; 376; 383k; 387; 393; 394; 396; 404; 409; 
411; 412; 413; 417; 421; 431. 

From these variants we expect amended characteristics, 
preferably amended activity characteristics. In fact, for 
5 several variants such amended characteristics have already been 
shown (see the experimental part) . Like above, "amended" means 
as compared to the model phytase. "Amended activity 
characteristics" means amended in at least one phytase activity 
related respect, such as (non-exclusive list) : pH stability, 

10 temperature stability, pH profile, temperature profile, specific 
activity (in particular in relation to pH and temperature) , 
substrate specificity, substrate cleavage pattern, substrate 
binding, position specificity, the velocity and level of release 
of phosphate from corn, reaction rate, phytate degradation 

15 rate) , end level of released phosphate reached. 

Preferred amended activity characteristics are amended 
specific activity, preferably increased, and preferably 
increased at a pH of 3, 4, 5, or 6; amended pH or temperature 
profile; and/or amended, preferably increased, thermostability, 

20 e.g. of an increased melting temperature as measured using DSC. 

Preferred phytase variants are: Phytase variants which, 
when aligned according to Fig. 1, are amended as compared to a 
model phytase in at least one of the following positions, using 
the position numbering corresponding to P_lycii: 

25 43; 44; 47; 51; 58; 62; 78; 80; 83; 88; 90; 102; 143; 148; 153; 
154; 186; 187a; 195; 198; 201e; 204; 205; 211; 215; 220; 242; 
244; 251e; 260; 264; 265; 267; 270; 273; 278; 302; 336; 337; 
339; 352; 365; 373; 383k; 404; 417. 

The following variants of A_fumigatus constitute a 

30 subgroup: Q43L; Q270L; G273D,K; N336S; A205E; Y278H; Q43L+Q270L; 



WO 99/49022 



PCT/DK99/00153 



43 

Q4 3L+Q270L+G273D; Q4 3L+Q270L+G273D+N336S; G273K+A205E; 

G273K+A205E+Y278H (see EP 0897010). 

Generally, variants of the invention can be deduced or 
identified as follows: Looking at the alignment according to 
5 Fig. 1, comparing two sequences, one of which is a model phytase 
with improved properties, identifying amino acid differences in 
relevant positions/areas, and transferring (substituting with) 
from the model to the other phytase sequence the amino acid in a 
relevant position. 

10 The invention also relates to a process for preparing a 

phytase variant which includes the above method, and further 
includes the deducement and synthesis of the corresponding DNA 
sequence, the transformation of a host cell, the cultivation of 
the host cell and the recovery of the phytase variant. 

15 Relevant positions/areas include those mentioned below in 

relation to important phytase activity characteristics such as 
specific activity, thermostability, pH activity/stability. 

The present invention also relates to phytase variants 
(varied according to a model phytase as defined herein) which 

20 are obtainable, preferably obtained, by the process outlined 
above and which are expected to exhibit an amended 
characteristic/property, preferably does exhibit such amended 
characteristic, e.g. an improved specific activity. 

At least the basidiomycete model phytases P_lycii and 

25 T_pubescens exhibit a high specific activity (as determined 
using the method of Example 2 herein) . 

This is an example of a desired property which can be 
transferred to other phytases, e.g. the other phytases listed in 
Fig. 1, in particular to the A__pediades and the ascomycete 

30 phytases such as A_fumigatus, A-ficuum, consphyA, by a 
deducement process such as the one mentioned above. 
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Thus, amended specific activity, in particular an improved 
specific activity, in particular at low pH and/or high 
temperature, is expected from variants, which have been amended 
in relevant areas, viz. (i) in the amino acid residues which 
5 point into the active site cleft; or (ii) in the amino acid 
residues in the close neighbourhood of these active site 
residues. Preferably, close neighbourhood means within 10A from 
the active site residues. 

From the pdb file HHP (Brookhaven Database entry of 

10 18.03.98 re HHP, Structure of Phosphomonoesterase, D.Kostrewa; 
or as published in Nature Structural Biology, 4, 1997, p. 185- 
190), active site regions can be identified, using the program 
INSIGHTII from Molecular Simulations MSI, San Diego, California, 
and using the subset command, an "active site shell" can be 

15 defined comprising those amino acid residues which lie close to 
the catalytic residues, defined as H59, D339 and R58 in A. 
ficuum phytase (corresponding to Peniophora numbers H71, D335 
and R70, respectively). An "active site shell (10A)" comprises 
those residues which lie within 10A from the above catalytic 

20 residues. 

The residues within 10A from H71 and D335 are the 
following (using Peniophora numbers) : 41-47, 68-77, 115-118, 
120-126, 128, 149-163, 185, 191-193, 199, 243, 270-271, 273-275, 
277-279, 288, 332-343, 364-367, 369-375, 394 ("the active site 
25 shell (10A) ") . 

Preferably, a "substrate binding shell" can also be 
defined which comprises those residues which are in close 
proximity to the substrate binding site and which can therefore 
be expected to be in contact with the substrate. 
30 This information can be deduced as described above, by 

docking a sugar analogue to phytin into the active site cleft 
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(the residues making up the surface of the active site) . If a 
sugar without any phosphate groups is docked into the active 
site cleft, e.g. alpha-D-glucose (chair conformation, structure 
provided by the INSIGHTII program) , using a fixed distance as 
5 shown below, the residues pointing towards the active site cleft 
can be extracted using the subset command and using a distance 
of 10A from the substrate analogue. Alternatively, the compound 
inositol-1, 4, 5-triphosphate (Brookhaven database file ldjx. 
Inositol-1, 4, 5-triphosphate) can be docked into the active site 
10 cleft. This compound and glucose, however, are more or less 
superimposable . 

The distances in Angstrom (A) are: From oxygen atom in 
position 6 of the alpha-D~glucose to 



atom ND1 of H59: 5.84 

15 atom NH2 of R58: 6.77 

atom NH2 of R142: 5.09 

atom ND2 of N340: 3.00 

atom ND1 of H59:' 7.76 

atom NH2 of R58: 8.58. 



20 (the Peniophora numbers of the above residues are: H71, 

R70, R155, N336, H71 and R70, respectively) . 

In this way, the residues in contact with the substrate 

are identified as follows (Peniophora numbers): 43-44; 70-80; 

83-84; 115; 153; 155-156; 184; 191-192; 198-202; 205; 235; 238; 
25 242; 270; 272-273; 275-277; 332-336; 338; 369; 371 ("the 

substrate binding shell (10A) ") . 

Variants being amended in one or more of (1) the active 

site shell or (2) the substrate binding shell, are strongly 

expected to have an amended specific activity. This leads to the 
30 following joint grouping of positions (still Peniophora numbers 

and 10A shells): 41-47, 68-80, 83-84, 115-118, 120-126, 128, 
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149-163, 184-185, 191-193, 198-201e, 202-203, 205, 235-236, 238- 
239, 242-243, 270-279, 285, 288, 332-343, 364-367, 369-375, 394. 

Preferably, the active site shell and the substrate 
binding shell are defined as described above using the 
5 basidiomycete model phytases of Fig. 1, the Peniophora phytase 
being a preferred model. A deducement of corresponding variants 
of other model phytases is possible using the alignment of Fig. 
1. 

In a preferred embodiment, a distance of 5A is used in the 
10 subset command, thus defining active site and substrate binding 
shells of a more limited size, e.g. an active site shell 
comprising the residues 43-44, 69-74, 117, 125, 155-156, 159, 
274, 332-340, 370-374 (5A from H71 and D335) , "active site 
shell (5A) n . 

15 Generally the active site shell and substrate binding 

shell regions form the basis for selecting random mutagenesis 
regions. Examples of preferred random mutagenesis regions are 

regions 69-74, 332-340, 370-374, doping to be added (a 5A 
approach) ; and 

20 regions 57-62, 142-146, 337-343, doping to be added (a 10A 

approach) . 

It is presently contemplated that any amendment in either 
of these positions will lead to a phytase of amended 
characteristics, e.g. of an amended specific activity. 
25 The above expression "any amendment in either of the 

positions'' is considered fully equivalent to listing each 
position and each substitution, e.g. as follows for the above 
sub-group 41-47: 

41A,C, D,E,F,G,H,I,K,L,M,N, P, Q, R, S, T, V, W, Y; 
30 42A,C, D,E,F,G,H, I , K, L, M, N, P, Q, R, S , T, V, W, Y; 
43A,C, D,E, F,G,H,I,K,L,M,N, P, Q, R, S , T, V, W, Y ; 
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44A / C, D,E, F,G,H,I,K,L,M,N, P,Q, R, S f T, V,W, Y; 

45A,C, D,E, F, G, H, I,K,L,M,N,P,Q,R,S,T,V,W,Y; 

4 6A,C, D, E, F,G,H, I,K, L,M, N f P,Q,R,S,T,V,W,Y; 

47A / C / D / E f F / G / H / I r K f L f M f N,P / Q / R,S / T f V f W f Y. 
5 In a preferred embodiment , amended specific activity is 

expected from the following variants: 

4 2 S , G ; 4 3A,C, D,E f F,G / H / I,K / L f M,N f P,Q f R / S / T f V f W r Y; 45D,S; 

47Y,F; 51E,A; 75W,F; 78S,D; 79G; 80K,A; 831,0; 84Q,V; 116S; 

118V, L; 119E; 120L; 122A; 123N,T; 125S; 126H / S; 127Q,E; 128A,T; 
10 151A,S; 152G; 153D,Y; 1540,0,6; 157V; 158D,A; 159T; 160A,S; 

161T,N; 162N; 163W; 184Q,S; 186A,E; 198A,N; 200G,V; 201D; 

deletions of one or more of 201a, 201b, 201c, 201d, 201e, 201f - 

preferably all; 202S; 205Q,E; 235Y,L; 238L,M; 242P; 270Y,A,L; 

271D; 273D,K; 275F,Y; 278T,H; 332F; 336S; 337T,Q; 339V; 340P,A; 
15 343A,S; 364W,F; 365V, L; 366D,V; 367K; 368K; 3691, L; 370V; 373S; 

374A; 375H; 376M; 393V. 

Particularly preferred variants are the following: 78S; 

79G; 80A; 831, Q; 84Q,V; 198A,N; 200G,V; 201D; deletions in one 

or more of 201a, 201b, 201c, 201d, 201e, 201f - preferably all 
20 deletions; 202S; 205Q,E; 235Y,L; 238L,M; 242P, 273D; 275F,Y. 

Other particularly preferred variants are the following: 

43A,C,D,E, F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W, Y; in particular 43M,P; 

75W,F; 80K; 153D; 184Q,S; 270Y,A; 332F; 3691, L. 

The following variants are especially preferred: 
25 43L,G,N,V,A, I,T; 78D; 153Y; 154G; 270L; 273D,K. Double and 

triple variants (43L/270L) ; ( 43L/270L/273D) ; (43L/78D) and 

(43L/153Y/154G) are also especially preferred. Other preferred 

variants are 205E; 278H; 336S. 

These especially preferred single, double and triple 
30 variants are preferably variants of model phytases which can be 
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aligned to Fig.l, in particular variants of the specific model 
phytases listed in Fig. 1. 

At least consphyA is known to have a high thermostability. 
Still further, the thermostability of P__lycii is rather high. 
5 This is an example of a desired property which can be 

transferred to other phytases, e.g. the other phytases listed in 
Fig. 1, in particular to the basidiomycete phytases such as 
P_lycii and A__pediades, by a deducement process such as the one 
mentioned above. 

10 Amended thermostability, in particular improved 

thermostability, is expected on this background from the 

following variants: 

39H,S; 40L,N; 43P; 47Y,F; 49P; 51E,A; 56P; 58D; 61R; 62V; 

80K; 83A; 84Y; 172P; 184P; 195T; 198A; 204V; 211L; 223D; 236Y; 
15 242P; 246V; 253P; 264R; 265Q; 280A, P; 283P; 287A; 292F,Y; 293A; 

302R; 304P; 337S; 348Y; 387P; 396R; 409R; 411K; 412R; 417E; 

421F, Y. 

The following variants of amended thermostability are 
particularly preferred: 39S; 40N; 47Y,F; 51A; 83A; 195T; 204V; 
20 211L; 242P; 265A. 

Further variants of amended thermostability are the 

following: 42G; 43T,L,G; 44N; 58K,A; 59G; 621; 69Q; 75F; 78D; 

79G; 80A; 81A,G; 82T; 83K,R; 841; 881; 90R,A; 102Y; 115N; 118V; 

122A; 123Q,N; 125M,S; 126V, S; 127N,Q; 128S,A; 143N,K; 148V, I; 
25 154S; 158D; 170fH; 170gA; 171T,N; 172N; 173W, 184S; 186A; 187A; 

187aS,\ 193S; 195V, L; 198V; 201E; 201eT; 202A, 203aT; 204A; 211V, 

215P,A; 220L,N; 223H; 228N; 232T; 322E; 235T; 236N; 242S; 244D; 

251eQ,E; 256D; 2641; 260A,H; 265A; 267D; 270G; 271D; 273K,D; 

278T,H; 287T; 293V; 302H; 337T,G; 3381; 339V, I; 340A; 352K; 
30 365A,S; 366S; 367A; 369L; 373S,A; 374S; 376M; 383kE,Q; 404G,A; 

411T; 417R; 431E. 
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Other concepts of the invention, which can be expected to 
impart an improved thermostability to a phytase, are as follows 
- considering the HHP structure previously referred to and 
transferring via an alignment according to Fig. 1 as outlined 
5 herein: 

(A) Introduction of prolin residues in spatial positions where 
the prolin special dihedral angles are satisfied and the 
hydrogen bonding network are not hampered and no steric clashes 
are observed. 

10 (B) Filling up holes: By substitution for bigger residues in 
internal cavities an improvement in stability can often be 
obtained. 

(C) Cystin bridge: Cystin bridges will often make the proteins 
more rigid and increase the energy of unfolding. 
15 Further variants from which amended thermostability is 

expected according to these concepts of (A) to (C) are: 27P, 
31Y, 132F, 1321, 132L, 184P, 186P, 190P, 280P, 343F, 3431, 343L, 
349P, 362P and (33C and 24C) . 

Concept (A): 27P, 184P, 186P, 190P, 349P, 362P. 
20 Concept (B) : 343F,I,L; 31Y; 132F,I,L; 273F. 

Concept (C) : 33C/24C. 

Amended pH activity or stability, preferably stability, in 
particular at low pH, in particular improved, is another desired 
property which can be transferred by aligning according to Fig. 

25 1 and transferring from models of improved pH profiles to other 
phytases - as outlined above. 

Other concepts of the invention, which can be expected to 
impart an improved stability at low pH to a phytase, are as 
follows - considering the HHP structure previously referred to 

30 and transferring via an alignment according to Fig. 1 as 
outlined herein: 
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(D) Surface charges: Better distribution at low pH, to avoid 
cluster of negative or positive, and to avoid too close same 
charged residues. 

(E) Prevent deamidation : Surface exposed Q or N in close 
5 contact to negative charged residues. 

Phytase variants having improved pH stability/activity at 
low pH are expected to be: 39H; 39Q; 80A; 203R; 271N; 51R; 154S; 
185S; 194S; 194T; 288L; 2881; 288F; 360R; 173Q,S; 204Q,S; 
303K,S; 81Q,E. 

10 Concept (D): 203R, 271N, 51R, 185S, 360R; 173Q,S; 204Q,S; 

303K,S; 81Q,E. 

Concept (E) : 154S; 194S,T; 288L,I,F. 

A preferred model phytase for these concepts of (D) and 
(E) is P_lycii. 

15 Experimentally proven to have a lowered pH optimum is: 

Variant 80A of ascomycete phytases, in particular of A_fumigatus 
and consphyA. 

Especially preferred single, double and triple variants 
are 43L; (43L/270L) and (43L/270L/273D) . These variants have a 
20 changed pH profile. They are preferably variants of the specific 
model phytases listed in Fig. 1. 

For all preferred variants listed above: 

the stability is preferably amended at high temperature, 
viz. in the temperature range of 50-100 °C, in particular 60- 
25 90°C, more preferably in the range of 70-90°C; 

the activity is preferably amended in a temperature range 
relevant for the use in the gastro-intestinal system of animals, 
e.g. 30-40°C, more preferably 32-38°C, most preferably in the 
range of 35-38°C; 

30 the stability is preferably amended at low pH, viz. in the 

pH range of pH 1.5-7, preferably 2-6, more preferably 3-5; 
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the activity is preferably amended in the pH range of pH 
1.5-5.5, more preferably at pH 2.5-4.5, still more preferably 3- 
5. 

Tests for amended phytase characteristics, such as those 
5 mentioned above, are well known in the art and any such test can 
be used to compare the performance of the phytase variants with 
the phytase models. 

A preferred test for specific activity is given in Example 
2. Preferred tests for pH and temperature activity and stability 
10 are given in Example 3. An even more preferred test for thermal 
stability is the DSC method of Example 4. 

WO 98/28409 discloses tests for various other parameters, 
too, such as position specificity. All the tests of WO 98/28409 
are preferred tests. 
15 Generally, of course all these tests can be conducted at 

desired pH values and temperatures. 

In the dependent claims, some preferred phytase variants 
based on five of the thirteen herein specifically disclosed 
model phytases are specified. 
20 In an analogous way other preferred variants based on the 

remaining eight specifically disclosed model phytases can easily 
be deduced by combining the suggested amendments with each of 
the corresponding sequences of Fig. 1. These preferred variants 
are specifically included in the present invention, and they are 
25 easily deducemed, viz. the following: 

Variants of a model phytase derived from Paxillus, 
preferably Paxillus involutus, preferably derived from strain 
CBS 100231, preferably variants of P_involtus-Al, the sequence 
of which is shown at Fig. 2, said variants comprising at least 
30 one of the following amendments: 

024C; T27P; F31Y; I33C; R39H,S,Q; N40L; S42G; 
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P43A,C,D,E,F,G,H,I,K,L,M / N,Q,R / S,T,V,W,Y; Y44N; S45D; 

Y47F; A51E,R; A58D;K; Q61R; I62V; F75W; S78D; A80K; T81Q,E,G,A; 

R83A,I,Q,K; I84Y,Q,V; L88I; K90R,A; F102Y; S115N; D116S; V118L; 

P119E; F120L; A123N,T,Q; S125M; F126H,S,V; D127Q,E,N; A128T,S; 
5 A132F,I,L; I148V; D151A,S; S153D,Y; D154Q,S,G; D158A; S159T; 

A160S; T161N; ()170fH; ()170gA; S171N; H172P; N173Q,S; P184Q,S; 

Q185S; T186A,E,P; G187A; ()187aS; T190P,A; D193S; N194S,T; 

M195T,V,L; A198N,V; G200V; D201E; ()201eT; S202A; D203R,K,S; 

P203aV,T; Q204E,S,A,V; V205E; V211L; S215A,I; L220N; A223D,H; 
10 D233E; F235Y,L,T; N236Y; L237F; V238L,M; A242P,S; M244D; 

()251eE,Q; D253P; T256D; P260A,H; E264R,I; A265Q; A267D; 

G270Y,A,L; D271N; D273K; F275Y; T278H; Y280A,P; E283P; V287A,T; 

Q288L,I,F; Y292F; V293A; N302R,H; A304P; N336S; L337T,Q,S,G; M 

3381; V339I; A340P; S343A, F,I,L; F348Y; R349P; A352K; P360R; 
15 R362P; W364F; R365V,L,A,S; T366D,V,S; S367K,A; S368K; L369I; 

S373A; G374A,S; R375H; ()383kQ,E; T387P; Q396R; G404A; L409R; 

T411K; L412R; E417R; F421Y. 

Variants of a model phytase derived from a species of the 

genus Paxillus, preferably the species Paxillus involutus, 
20 preferably derived from strain CBS 100231, preferably variants 

of P_involtus-A2, the sequence of which is shown at Fig. 3, said 

variants comprising at least one of the following amendments: 

P24C; I27P; F31Y; I33C; R39H,S,Q; N40L; S42G; 

P43A,C,D,E,F,G,H / I,K,L,M,N,Q,R,S,T,V,W,Y; Y44N; S45D; 
25 Y47F; A51E,R; A58D,K; E61R; I62V; F75W; S78D; A80K; A81Q,E,G; 

R83A, I,Q,R,K; I84Y,Q,V; L88I; K90R,A; F102Y; S115N; D116S; 

V118L; P119E; F120L; A123N,T,Q; S125M; F126H,S,V; D127Q,E,N; 
A128T,S; V132F,I,L; D143N; I148V; D151A,S; S153D,Y; D154Q,S,G; 

D158A; A160S; T161N; ()170fH; ()170gA; S171N; R172P; N173Q,S; 
30 P184Q,S; Q185S; T186A,E,P; G187A; ()187aS; T190P,A; D193S; 
N194S,T; M195T,V,L; A198N,V; G200V; E201D; ()201eT; S202A; 
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D203R,K,S; P203aV,T; Q204E,S,A,V; V205E; S211L,V; S215A, P; 

L220N; A223D, H; A232T; F235Y,L,T; N236Y; L237F; V238L,M; P242S; 

M244D; ()251eE,Q; D253P; T256D; P260A,H; E264R,I; A265Q; A267D; 

G270Y,A,L; D271N; D273K; F275Y; T278H; Y280A,P; A283P; V287A,T; 
5 Q288L,I,F; Y292F; I293A,V; N302R,H; A304P; N336S; L337T,Q,S,G; 

M338I; V339I; 340P,A; A343S, F, I, L; F348Y; R349P; A352K; P360R; 

R362P; W364F; L365V,A,S; T366D,V,S; S367K,A; S368K; V369I,L; 

S373A; R375H; ()383kQ,E; T387P; Q396R; G404A; L409R; A411K, T; 

L412R; E417R; Y421F. 
10 Variants of a model phytase derived from a species of the 

genus Trametes, preferably the species Trametes pubescens, 

preferably derived from strain CBS 100232, preferably variants 

of T_pubescens, the sequence of which is shown at Fig. 4, said 

variants comprising at least one of the following amendments: 
15 R24C; T27P; L31Y; V33C; Q39H,S; S40L,N; S42G; 

M43A,C,D,E,F,G,H,I,K,L,N,P,Q, R,S,T,V,W,Y; Y44N; S45D; 

Y47F; A51E,R; A58D,K; S59G; Q61R; I62V; F75W; S78D; A80K; 

A81Q,E,G; R83A, I,Q,K; I84Y,Q,V; V88I; K90R,A; L102Y; D115N; 

V118L; T123N,Q; S125M; S126H,V; E127Q,N; A128T,S; A132F, I,L; 
20 D143N; V148I; S151A; S153D,Y; D154Q,S,G; A158D; A160S; N161T; 

()170fH; ()170gA; S171N; S172P; N173Q,S; S184Q,P; E185S; 

A186E,P; G187A; ()187aS; T190P,A; N194S,T; M195T,V,L; A198N,V; 

G200V; ()201eT; S202A; D203R,K,S; P203aV,T; Q204E,S,A,V; V205E; 

Q211L,V; P215A; L220N; G223D,H; D233E; Y235L,T; N236Y; L237F; 
25 L238M; P242S; E244D; ()251eE,Q; E253P; Q260A,H; D264R,I; A265Q; 

A267D; A270Y,L,G; D271N; D273K; F275Y; T278H; Y280A,P; V287A, T; 

Q288L,I,F; Y292F; I293A,V; A302R,H; N304P,A; N336S; Q337T,S,G; 

M338I; V339I; A340P; S343A,F,I,L; F348Y; N349P; A352K; P360R; 

R362P; F364W; L365V,A,S; V366D,S; K367A; I369L; A373S; A374S; 
30 R375H; ()383kQ,E; Q387P; A396R; G404A; V409R; T411K; L412R; 

E417R; Y421F. 
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Variants of a model phytase derived from a species of the 

genus Aspergillus, preferably the species Aspergillus nidulans, 

preferably derived from strain DSM 9743, preferably variants of 

A_nidulans, the sequence of which is shown at Fig. 10 , said 
5 variants comprising at least one of the following amendments: 

V24C; A27P; H39S,Q; V40L,N; G42S; 

Q4 3A,C, D,E, F,G,H, I,K,L,M,N, P, R, S , T, V, W, Y; Y4 4N; S4 5D; 

Y47F; S49P; E51A, R; V56P; H58D,K,A; E61R; V62I; S69Q; Y75W,F; 

E78D,S; S79G; K80A; S81Q,E,A,G; K82T; A83I,Q,K,R; Y84Q,V,I; 
10 A90R; D115N; D116S; T118V,L; I119E; F120L; E122A; N123T,Q; 

M125S; V126H / S; D127Q,E,N; S128A,T; F132I,L; K143N; I148V; 

S151A; S153D,Y; D154Q, S,G; A158D; S159T; A160S; E161T,N; K162N; 

F163W; G170fH; S170gA; ()171N; ()172P; K173Q f S; P184Q,S; E185S; 

I186A,E,P; D187A; G187aS; T190P,A; H193S; S194T; S198A,N,V; 
15 E200G,V; N201D,E; D201e(); E201e(),T; R201f() (a deletion of at 

least one of 201d, 201e, 201f, preferably all); A202S; 

D203R,K,S; E203aV,T; I204Q,E, S,A, V; I211L, V; P215A; L220N; 

D223H; K228N; E232T; N233E; I235Y,L,T; Y236N; L237F; M238L; 

S242P; M246V; E251eQ; A256D; E260A / H; L264R, I; Q270Y, A, L, G; 
20 S271D,N; S273D,K; Y275F; G278T,H; A280P; A287T; Q288L,I,F; 

F292Y; T293A,V; Q302R,H; P304A; N336S; S337T,Q,G; M338I; I339V; 

S340P,A; F343A,S,I,L; N349P; Q352K; S360R; Q362P; Y364W,F; 

A365V,L,S; A366D,V,S; S367K,A; W368K; T369I,L; G373S,A; A374S; 

R375H; A376M; E383kQ; A404G; T411K; L412R; E417R; F421Y; K431E. 
25 Variants of a model phytase derived from a species of 

Aspergillus, preferably Aspergillus terreus, preferably derived 

from strain CBS 220.95, preferably variants of A_terreus, the 

sequence of which is shown at Fig. 12, said variants comprising 

at least one of the following amendments: 
30 G24C; V27P; H39S,Q; K40L,N; G42S; 
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L43A,C,D,E,F,G,H,I,K,M,N,P,Q,R,S,T,V,W,Y; Y44N; A45D,S; 

Y47F; S49P; Q51E,A,R; V56P; P58D,K,A; D59G; H61R; I62V; A69Q; 

S75W f F; H78D,S; S79G; K80A; T81Q,E,A,G; A83I,Q,K,R; Y84Q,V,I; 

A90R; E115N; E116S; T118V,L; P119E; F120L; R122A; N123T,Q; 
5 L125S,H; R126H,S,V; D127Q,E,N; L128A,T,S; F132I,L; H143N; V148I; 

T151A,S; D152G; A153D,Y; S154D,Q,G; H157V; E158D,A; S159T; 

A160S; E161T,N; K162N; F163W; H173Q,S; P184Q,S; E185S; 

G186A,E,P; S187A; A187aS; T190P,A; H193S; S194T; L195T,V; 

A198N,V; E200G,V; S201D,E; S201d(); T201e(); V201f(); G202S,A; 
10 D203R,K,S; D203aV,T; A204Q,E,S,V; V205E; V211L; A215P; L220N; 

D223H; Q228N; D232T; D233E; V235Y,L,T; N236Y; L237F; M238L; 

P242S; E244E; T251eE,Q; A260H; T264R, I; Q265A; N267D; L270Y,A,G; 

S271D f N; K273D; Y275F; H278T; G280A,P; V287A,T; Q288L,I,F; 

W292F,Y; A293V; Q302H; P304A; N337T,Q,S,G; L338I; V339I; 
15 S340P,A; W343A, S, F, I, L; N349P; A352K; S360R; S362P; Y364W,F; 

A365V,L,S; A366D,V, S; A367K; W368K; T369I,L; A373S; A374S; 

R375H; A376M; R383kQ,E; P404A,G; K411T; A417E,R; F421Y; A431E. 

Variants of a model phytase derived from a species of 

Talaromyces, preferably the species Talaromyces thermophilus, 
20 preferably derived from strain ATCC 20186 or ATCC 74338, 

preferably variants of T_thermo, the sequence of which is shown 

at Fig. 13, said variants comprising at least one of the 

following amendments : 

H24C; V27P; H39S,Q; S40L,N; G42S; 
25 Q43A,C,D,E,F,G,H,I,K,L,M,N,P,R,S,T,V,W, Y; Y44N; S45D; 

F47Y; S4 9P; A51E,R; V56P; Q58D,K,A; N59G; K61R; I62V; Y75W, F; 

S78D; S79G; K80A; T81Q,E,A,G; E82T; L83A, I , Q, R, K; Y84Q,V,I; 

R90A; D116S; T118V,L; P119E; F120L; E122A; N123T,Q; M125S; 

I126H,S,V; Q127E,N; L128A, T, S; F132I,L; V148I; S151A; S153D,Y; 
30 D154Q,S,G; I157V; A158D; S159T; G160A, S; R161T,N; L162N; F163W; 
S170gA; D171N; K172P; H173Q,S; E184Q,S,P; E185S; G186A,E,P; 
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D187A; T190P,A; T193S; G194S,T; S195T,V,L; V198A,N; E200G,V; 

D201E; S201d(); S201e(),T; S201f(); G202S,A; H203R,K,S; 

D203aV,T; A204Q,E,S,V; Q205E; Q211L,V; A215P; I220N,L; H223D; 

D228N; S232T; D233E; P235Y,L,T; Y236N; M237F; D238L,M; P242S; 
5 E244D; L246V; ()251eE,Q; A256D; Q260A,H; Q264R,I; A265Q; 

Q270Y,A,L,G; S271D,N; G273D,K; Y275F; N278T,H; G280A, P; A287T; 

Q288L,I,F; F292Y; V293A; H302R; P304A; N336S; T337Q,S,G; M338I; 

T339V, I; S340P,A; A343S,F,I,L; N349P; A352K; S360R; E362P; 

Y364W, F; S365V,L,A; A366D,V,S; A367K; W368K; T369I,L; G373S,A; 
10 G374A, S; R375H; A376M; D383kQ,E; E404A; K411T; R417E; F421Y. 

Variants of a model phytase derived from a species of 

Thermomyces, preferably the species Thermomyces lanuginosus, 

preferably derived from strain DBS 58 6.94, preferably variants 

of T_lanuginosa, the sequence of which is shown at Fig. 14, said 
15 variants comprising at least one of the following amendments: 

K24C; ()27P; ()31Y; ()33C; R39H,S,Q; H40L,N; G42S; 

Q43A,C,D,E,F,G,H, I , K, L, M, N, P, R, S , T, V, W, Y; Y44N; S45D; 

F47Y; S49P; A51E,R; V56P; K58D,A; V62I; S69Q; Y75W,F; A78D, S; 

H79G; K80A; S81Q,E,A,G; E82T; V83A, I , Q, K, R; Y84Q,V,I; L88I; 
20 R90A; F102Y; D115N; N116S; T118V,L; R119E; F120L; E122A; 

E123N,T,Q; M125S; M126H,S,V; E127Q,N; S128A,T; F132I,L; E143N; 

V148I; A151S; S153D,Y; A154D, Q, S, G; I157V; A158D; S159T; A160S; 

E161T,N; F162N; F163W; R170fH; S170gA; K172P; D173Q,S; S184Q, P; 

E185S; E186A, P; T187A; G187aS; T190P,A; G193S; L194S,T; T195V,L; 
25 A198N,V; E200G,V; E201D; A201d(); P201e(),T; D202S,A; P203R,K,S; 

T203aV; Q204E,S,A,V; P205E; V211L; R215A,P; I220L,N; H223D; 

E232T; D233E; P235Y,L,T; L236Y,N; M238L; P242S; Q251eE; H256D; 

Q260H; M264R, I; A265Q; Y270A,L,G; T271D,N; D273K; Y275F; H278T; 

G280A,P; A283P; S287A; R288L,I,F; F292Y; V293A; G302R,H; P304A; 
30 N336S; T337Q,S,G; M338I; T339V,I; G340P,A; S343A,F,I,L; N349P; 

P360R; T362P; Y364W, F; A365V,L,S; A366D,V,S; S367K,A; W368K; 
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T369I,L; A373S; A374S; R375H; A376M; E383kQ; R404A,G; R411K,T; 
K417E,R; F421Y; D431E. 

Variants of a model phytase derived from a species of 
Myceliophthora, preferably the species Myceliophthora 
5 thermophila, preferably derived from strain ATCC 48102 or ATCC 
74340, preferably variants of M_thermophila, the sequence of 
which is shown at Fig. 7, said variants comprising at least one 
of the following amendments: 
S24C; F31Y; H39S,Q; F40L,N; G42S; 
10 Q43A,C,D,E,F,G,H,I,K,L,M,N,P,R,S,T,V,W,Y; Y44N; S45D; 

Y47F; S49P; P51E,A,R; I56P; D58K,A; D59G; E61R; V62I; S69Q; 
A75W,F; L78D,S; K79G; R80K,A; A81Q,E,G; A82T; S83A, I , Q, K, R; 
Y84Q,V,I; R90A; D115N; E116S; T118V,L; R119E; T120L; Q122A; 
Q123N,T; M125S; V126H,S; N127Q,E; S128A,T; F132I,L; K143N; 
15 V148I; A151S; Q153D,Y; D154Q,S,G; H158D,A; S159T; A160S; 
E161T,N; G170fH; S170gA; T171N; F163W; V172P; R173Q,S; P184Q,S; 
E185S; T186A, E,P; G187aS; T190P,A; N193S; D194S,T; L195T,V; 
A198N,V; E200G,V; E201D; G201a(); P201b(); Y201c(); S201d(); 
T201e(); I201f(); G202S,A; D203R,K,S; D203aV,T; A204Q, E, S, V; 
20 Q205E; T211L,V; P215A; V220N,L; N223D,H; A232T; D233E; 
V235Y,L,T; A236Y,N; L237F; M238L; P242S; E244D; A251eE,Q; R256D; 
E260A,H; R264I; A265Q; Q270Y, A, L, G; S271D,N; K273D; Y275F; 
Y278T,H; P280A; T287A; Q288L,I,F; F292Y; V293A; ()302R,H; P304A; 
N336S; D337T,Q,S,G; M338I; M339V,I; G340P,A; G343A, S, F, I , L; 
25 D349P; P352K; D360R; E362P; Y364W, F; A365V,L,S; A366D,V,S; 
S367K,A; W368K; A369I,L; A373S; A374S; R375H; I376M; E383kQ; 
E387P; G404A; M409R; T411K; L412R; E417R; F421Y; D431E. 

This invention also provides a new phytase which has been 
derived from a strain of Cladorrhinum, viz. C. f oecundissimum. 
30 Accordingly, the invention also relates to a polypeptide having 
phytase acitivity and which comprises SEQ ID NO: 2 or the mature 
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part (amino acids nos 16-495) thereof; or a polypeptide being at 
least 70, more preferably 75, 80, 85, 90, 95% homologous 
thereto; homology meaning similarity, preferably identity, and 
being determined using the program GAP and the settings as 
5 defined hereinabove. And the invention relates to a DNA 
construct which encodes a polypeptide having phytase activity, 
said DNA construct comprising a DNA molecule which comprises 
SEQ ID NO:l or nucleotides nos. 20-70 and 207-1560 thereof; or 
nucleotides nos. 20-70 and 207-1563 thereof; or nucleotides nos. 

10 65-70 and 207-1560 thereof; or nucleotides nos. 65-70 and 207- 
1563 thereof; or a DNA construct or molecule which is at least 
70, 75, 80, 85, 90, 95 % homologous to either of these 
nucleotide sequences; homology meaning similarity, preferably 
identity, and being determined using computer programs known in 

15 the art such as GAP provided in the GCG program package (Program 
Manual for the Wisconsin Package, Version 8, August 1996, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, 
USA 53711) (Needleman, S.B. and Wunsch, CD., (1970), Journal 
of Molecular Biology, 48, 443-453) . Using GAP with the following 

20 settings for DNA sequence comparison: GAP creation penalty of 
5.0 and GAP extension penalty of 0.3. The invention also relates 
to a DNA construct which hybridizes with any of the above DNA 
sequences under the conditions mentioned hereinabove. 

25 EXAMPLES 
Example 1 

Phytase activity assay (FYT) 

Phytase activity can be measured using the following 

assay: 

30 10 pi diluted enzyme samples (diluted in 0.1 M sodium acetate, 
0.01 % Tween20, pH 5.5) are added into 250 \il 5 mM sodium 
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phytate (Sigma) in 0.1 M sodium acetate, 0.01 % Tween20, pH 5.5 
(pH adjusted after dissolving the sodium phytate; the substrate 
is preheated) and incubated for 30 minutes at 37 °C. The reaction 
is stopped by adding 250 ]il 10 % TCA and free phosphate is 
5 measured by adding 500 pi 7.3 g FeS04 in 100 ml molybdate 
reagent (2.5 g (NH 4 ) 6 Mo 7 0 24 . 4H 2 0 in 8 ml H 2 S0 4 diluted to 250 ml). 
The absorbance at 750 nm is measured on 200 ]il samples in 96 
well microtiter plates. Substrate and enzyme blanks are 
included. A phosphate standard curve is also included (0-2 mM 
10 phosphate) . 1 FYT equals the amount of enzyme that releases 1 
ymol phosphate/min at the given conditions. 

Example 2 

Test for specific activity 

15 The specific activity can be determined as follows: 

A highly purified sample of the phytase is used (the 

purity is checked beforehand on an SDS poly acryl amide gel 

showing the presence of only one component) . 

The protein concentration in the phytase sample is 
20 determined by amino acid analysis as follows: An aliquot of the 

phytase sample is hydrolyzed in 6N HC1, 0.1% phenol for 16 h at 

110 C in an evacuated glass tube. The resulting amino acids are 

quantified using an Applied Biosystems 420A amino acid analysis 

system operated according to the manufacturers instructions. 
25 From the amounts of the amino acids the total mass - and thus 

also the concentration - of protein in the hydrolyzed aliquot 

can be calculated. 

The activity is determined in the units of FYT. One FYT 

equals the amount of enzyme that liberates 1 micromol inorganic 
30 phosphate from phytate (5 mM phytate) per minute at pH 5.5 f 

37 °C; assay described e.g. in example 1. 
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The specific activity is the value of FYT/mg enzyme 
protein. 

Example 3 

5 Test for temperature and pH activity and stability 

Temperature and pH activity and stability can be 
determined as follows: 

Temperature profiles (i.e. temperature activity 
relationship) by running the FYT assay of Example 1 at various 
10 temperatures (preheating the substrate) . 

Temperature stability by pre-incubating the phytase in 0.1 
M sodium phosphate, pH 5.5 at various temperatures before 
measuring the residual activity. 

The pH-stability by incubating the enzyme at pH 3 (25 mM 
15 glycine-HCl) , pH 4-5 (25 mM sodium acetate), pH 6 (25 mM MES), 
pH 7-9 (25 mM Tris-HCl) for 1 hour at 40°C, before measuring the 
residual activity. 

The pH-profiles (i.e. pH activity relationship) by running 
the assay at the various pH using the same buffer-systems (50 
20 mM, pH re-adjusted when dissolving the substrate) . 

Example 4 

DSC as a preferred test for thermostability 

The thermostability or melting temperature, Tm, can be 
25 determined as follows: 

In DSC the heat consumed to keep a constant temperature 
increase in the sample-cell is measured relative to a reference 
cell. A constant heating rate is kept (e.g. 90°C/hour) . An endo- 
thermal process (heat consuming process - e.g. the unfolding of 
30 an enzyme/protein) is observed as an increase in the heat 
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transferred to the cell in order to keep the constant 
temperature increase . 

DSC can be performed using the MC2-apparatus from 
MicroCal. Cells are equilibrated 20 minutes at 20°C before 
5 scanning to 90°C at a scan rate of 90°/h. Samples of e.g. around 
2.5 mg/ml phytase in 0.1 M sodium acetate, pH 5.5 are loaded. 

Example 5 

Phytase variants of amended activity characteristics 

Variants of an Aspergillus fumigatus model phytase (a wild 
10 type phytase derived from strain ATCC 13073) were prepared as 
described in EP 98104858.0 (EP-A-0897010) , examples 2-3 and 5, 
and the phytase activity was determined as described in example 
7 thereof. pH- and temperature optimum and melting point was 
determined as described in examples 9 and 10 of EP 98113176.6 
15 (EP-A-0897985) . 

In Table 1, variants of improved specific activity at pH 
5.0 are listed. Table 2 lists variants of improved relative 
activity at pH 3.0, and Table 3 lists variants of improved 
thermostability (temperature optimum, e.g. determined by DSC) . 

20 



Table 1 



Amended in position 
no. 


Substitution into 


Specific activity at 
pH 5.0 (U/mg) 


43 


43L 


83. 4 


43N 


45.5 


43T 


106.9 


431 


91.2 


43V 


35. 0 


43A 


27. 3 


43G 


59. 6 
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43 and 270 


43L, 270L 


88.7 




4j ana 27 0 and 27 3 


43L, 270L, 273D 


92.3 


43 and 78 


43L, 78D 


118.5 


43 and 153 and 154 


43L. 153Y l S4r; 


193. 0 


A. fumigatus wild- 
type phytase 




26.5 




Table 2 

Amended in position 
no . 


Substitution into 


activity . 


phytase 
at pH 3.0 


205 


205E 


41% 


273 


273K 


61% 


278 


278H 


75% 


273 and 205 


273K, 205E 


65% 


273 and 278 


273K, 278H 


100% 


273 and 205 and 278 


273K 205E ?7flH 


96% 


A, fumigatus wild- 
tyP e phytase 




32% 




Table 3 

Amended in position 
no . 


Substitution into 


Tempera- 
ture 
optimum 

<°c> 


Tm (°C) 
(DSC) 


43 and 47 and 88 and 
102 and 220 and 242 
and 2 67 


43T, 47Y, 881, 102Y, 
220L, 242P, 267D 


60 


67 


as above plus 51 and 
302 and 337 and 373 
and 115 


as above plus 51A, 
302H, 337T, 373A, 
115N 


63 





WO 99/49022 PCT/DK99/00153 



63 


A. fumigatus wild- 




55 


62. 5 


type phytase 









Example 6 

Further phytase variants of amended activity characteristics 

Variants of the ascomycete consensus sequence "conphys" of 
Fig. 9 were prepared as described in EP 98113176.6 (EP-A- 
5 0897985), examples 4-8. Phytase activity, including pH- and 
temperature optimum, and melting point was determined as 
described in examples 9 and 10, respectively, thereof. 

The tables below list variants of amended activity 
characteristics, viz . 
10 Table 4 variants of improved specific activity at pH 6.0; 

Table 5 variants of amended pH optimum (the pH-optimum 
indicated is an approximate value, determined as that pH-value 
(selected from the group consisting of pH 4.0; 4.5; 5.0; 5.5; 
6.0; 6.5; and 7;0) at which the maximum phytase activity was 
15 obtained) ; 

Table 6 a variant of improved thermostability (expressed 
by way of the melting point as determined by differential 
scanning calorimetry (DSC) ) ; and 

JCable 7 variants of amended thermostability (temperature 
20 optimum) ; a or indicates a positive or a negative, 

respectively, effect on temperature optimum of up to 1°C; and a 
"++" and means a positive or a negative, respectively, 

effect on temperature optimum of between 1 and 3°C. 



25 Table 4 



Amende 


id in position 


Substitution into 


Specific activity at 


no. 






pH 6.0 (U/mg) 


43 


43T 


130 



WO 99/49022 



PCT/DK99/00153 





43L 


205 


Conphys 
Table 5 

Amended In position 
no. 


Substitution into 


62 

pH optimum 
around 


43 


4 3T 


6.0 


43L 


5.5 


43G 


6.5 


43 and 44 


43L, 44N 


6.0 


43T, 44N 


5.5 


Conphys 
Table 6 

Amended In position 
no. 


Substitution into 


6 . 0 

Tm (°C) 


43 


43T 


78 . 9 


Conphys 
Table 7 

Amended in position 
no. 


Substitution into 


78 1 

Temperature optimum 
amendment 


51 


A 


+ 


58 


K 


+ 


220 


N 


+ 


195 


L 


++ 


201e 


T 


++ 


244 


D 


+ 


2 64 


I 


+ 


302 


H 


+ 



» 
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337 


T 


++ 


352 


K 


+ 


373 


A 


++ 


47 


F 


■ 


62 


I 




83 


K 




90 


R 




143 


N 




148 


V 




186 


A 




187a 


S 




198 


V 




204 


A 




211 


V 




215 


P 




251e 


Q 




260 


A 




265 


A 




339 


V 


— 


365 


A 




383k 


E 




404 


G 




417 


R 




Conphys 




0 
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Table 8 



Amended in position 
no • 


Substitution 
into 


Tm (°C) (DSC) 


activity at 
pH 5.0 (U/mg) 


h j ana o± ana ^zu 
and 244 and 264 and 
302 and 337 and 352 
and 373 


3lA, ZZON, 
244D, 2641, 
302H, 337T, 
352K, 373A, 
43T 


84 . 7 


105 


as above plus 80 


as above plus 
80A 


85.7 


180 


Conphys 




78. 1 


30 



Example 7 

Cloning of a phytase of Cladorrhinum f oecundissimum 

DNA encoding a phytase from Cladorrhinum f oecundissimum 
CBS 427.97 has been cloned, and the enzyme isolated and 
purified, essentially as described in WO 98/28409. 

Fig. 15 shows the DNA sequence of the Hindlll/Xbal cloned 
PCR product in pA2phy8 . The cloned PCR product is amplified from 
the genomic region encoding Cladorrhinum f oecundissimum CBS 
427.97 phyA gene. The putative intron is indicated by double 
underline of the excision-ligation points in accordance with the 
GT-AG rule (R. Breathnach et al. Proc. Natl. Acad. Sci . USA 75 
(1978) pp4853-4857) . The restrictions sites used for cloning are 
underlined. 

According to the SignalP VI . 1 prediction (Henrik Nielsen, 
Jacob Engelbrecht, Stren Brunak and Gunnar von Heijne: 
5 "Identification of prokaryotic and eukaryotic signal peptides 
and prediction of their cleavage sites , " Protein Engineering 10, 
1-6 (1997)), the signal peptide part of the enzyme corresponds 
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to amino acids nos. 1-15, accordingly the mature enzyme is amino 
acids nos. 16-495. 

The enzyme exhibits a pH optimum around pH 6 with no ac- 
tivity at the low pH (pH 3), but significant activity up until 
5 pH 7.5; thus it is a more alkaline phytase as compared to the 
Aspergillus ficuum phytase. 

A temperature optimum around 60 °C was found at pH 5.5. 
Thus, this phytase is more thermostable than the A. ficuum phy- 
tase. 

10 

Example 8 

Alignment: of a new model phytase according to Fig. 1 

The phytase sequence of Cladorrhinum f oecundissimum as 
disclosed in Example 7 is compared with the 13 model phytases of 

15 Fig. 1 using GAP version 8 referred to above with a GAP weight 
of 3.000 and a GAP lengthweight of 0.100. Complete amino acid 
sequences are compared. The M_thermophila phytase sequence turns 
up to be the most homologous sequence, showing a degree of 
similarity to the C. f oecundissimum sequence of 70.86%. 

20 Still using the GAP program and the parameters mentioned 

above, the phytase sequence "C_f oecundissimum'' is now aligned to 
the "M-thermophila" phytase - see Fig. 16. The average match is 
0.540; , the average mismatch -0.396; quality 445.2; length 505; 
ratio 0.914; gaps 9; percent similarity 70.860; percent identity 

25 53.878. 

In a next step, see Fig. 17, the C_f oecundissimum is 
pasted (or it could simply be written) onto the alignment of 
Fig. 1 as the bottom row, ensuring that those amino acid 
residues which according to the alignment at Fig. 16 are 
30 identical (indicated by a vertical line) or similar (indicated 
by one or two dots) are placed above each other. At 5 places 
along the sequence, the C_f oecundissimum sequence comprises 
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"excess" amino acid residues, which the alignment of Fig. 1 does 
not make room for. At Fig. 17, these excess residues are 
transferred onto a next row (but they can be included in the 
multiple alignment and numbered as described previously in the 
5 position numbering related paragraphs (using the denotations a, 
b, c etc. ) . 

Corresponding variants of the phytase of C__f oecundissimum 
are then easily deduced on the basis of Fig. 17. Some examples: 
The variants generally designated "80K,A" and M3T" in 
10 C_foecundissimum correspond to "K80A" and "Q43T," respectively. 
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1. A phytase variant which, when aligned according to Fig. 1, 
is amended as compared to a model phytase in at least one of the 
following positions, using the position numbering corresponding 

5 to P_lycii: 

24; 27; 31; 33; 39; 40; 41; 42; 43; 44; 45; 46; 47; 49; 51; 56; 

58; 59; 61; 62; 68; 69; 70; 71; 72; 73; 74; 75; 76; 77; 78; 79; 

80; 81; 82; 83; 84; 88; 90; 102; 115; 116; 117; 118; 119; 120; 

121; 122; 123; 124; 125; 126; 127; 128; 132; 143; 148; 149; 150; 
10 151; 152; 153; 154; 155; 156; 157; 158; 159; 160; 161; 162; 163; 

170f; 170g; 171; 172; 173; 184; 185; 186; 187; 187a; 190; 191; 

192; 193; 194; 195; 198; 199; 200; 201; 201a; 201b; 201c; 201d; 

201e; 201f; 202; 203; 203a; 204; 205; 211; 215; 220; 223; 228; 

232; 233; 234; 235; 236; 237; 238; 239; 242; 243; 244; 246; 
15 251e; 253; 256; 260; 264; 265; 267; 270; 271; 272; 273; 274; 

275; 276; 277; 278; 279; 280; 283; 285; 287; 288; 292; 293; 302; 

304; 332; 333; 334; 335; 336; 337; 338; 339; 340; 341; 342; 343; 

348; 349; 352; 360; 362; 364; 365; 366; 367; 368; 369; 370; 371; 

372; 373; 374; 375; 376; 383k; 387; 393; 394; 396; 404; 409; 
20 411; 412; 413; 417; 421; 431. 

2. A phytase variant which, when aligned according to Fig. 1, 
comprises at least one of the following amendments as compared 
to a model phytase, using the position numbering corresponding 

25 to the phytase of P_lycii: 

24C; 27P; 31Y; 33C; 39H,S,Q; 40L,N; 42S,G; 

43A,C,D,E,F,G,H,I,K,L,M,N,P,Q,R,S,T,V,W,Y; 44N; 45D,S; 47Y,F; 
49P; 51E,A,R; 56P; 58D,K,A; 59G; 61R; 62V, I; 69Q; 75W,F; 78D,S; 
79G; 80K,A; 81A,G,Q,E; 82T; 83A,I,K,R,Q; 84I,Y,Q,V; 881; 90R,A; 
30 102Y; 115N; 116S; 118V, L; 119E; 120L; 122A; 123N,Q,T; 125M,S; 
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126H,S,V; 127Q,E,N; 128A,S,T; 132F,I,L; 143N; 148V, I; 151A,S; 

152G; 153D,Y; 1540,0,3,6; 157V; 158D,A; 159T; 160A,S; 161T,N; 

162N; 163W; 170fH; 170gA; 171N; 172P; 173Q,S; 184Q,S,P; 185S; 

186A,E,P; 187A; 187aS; 190A,P; 193S; 194S,T; 195T,V,L; 198A,N,V; 
5 200G,V; 201D,E; 201a(); 201b(); 201c(); 201d(); 201e(); 201f(); 

201eT; 202S,A; 203R,K,S; 203aV,T; 204Q, E, S, A, V; 205E; 211L,V; 

215A,P; 220L,N; 223H,D; 228N; 232T; 233E; 235Y,L,T; 236Y,N; 

237F; 238L,M; 242P,S; 244D; 246V; 251eE,Q; 253P; 256D; 260A,H; 

264R,I; 265A,Q; 267D; 270Y,A,L,G; 271D,N; 273D,K; 275F,Y; 
10 278T,H; 280A, P; 283P; 287A,T; 288L,I,F; 292F,Y; 293A,V; 302R,H; 

304P,A; 332F; 336S; 337T,G,Q,S; 3381; 339V, I; 340P,A; 

343A,S,F,I,L; 348Y; 349P; 352K; 360R; 362P; 364W, F; 365V,L,A,S; 

366D,S,V; 367A,K; 368K; 3691, L; 370V; 373A,S; 374S,A; 375H; 

376M; 383kQ,E; 387P; 393V; 396R; 404A,G; 409R; 411K,T; 412R; 
15 417E,R; 421F,Y; 431E. 

3. The phytase variant of any of claims 1 or 2, which is 
derived from an ascomycete phytase. 

20 4. The phytase variant of claim 3 which is derived from an 
Aspergillus phytase . 

5. The phytase variant of claim 4, wherein the model phytase 
is a strain of Aspergillus niger, Aspergillus ficuum, 

25 Aspergillus nidulans, Aspergillus fumigatus, Aspergillus 
terreus . 

6. The phytase variant of claim 5 wherein the model phytase 
is Aspergillus nidulans DSM 9743; or any of the following 

30 strains of Aspergillus terreus: CBS 116.46, DSM 9076, CBS 
220. 95. 
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7. The phytase variant of claim 6 wherein the model phytase 
is the Aspergillus nidulans phytase sequence shown in Fig. 10; 
or the Aspergillus terreus phytase sequence shown in Fig. 12. 

5 

8. The phytase variant of claim 3 wherein the model phytase 
is a strain of Thermomyces lanuginosus, Talaromyces 
thermophilus, or Myceliophthora thermophila. 

10 9. The phytase variant of claim 8 wherein the model phytase 
is Thermomyces lanuginosus CBS 586.94; or any of the following 
strains of Talaromyces thermophilus: ATCC 20186, ATCC 74338; or 
any of the following strains of Myceliophthora thermophila: ATCC 
34625, ATCC 74340. 

15 

10. The phytase variant of claim 9 wherein the model phytase 
is the Thermomyces lanuginosus phytase sequence shown in Fig. 14; 
or the Talaromyces thermophilus sequence shown in Fig. 13; or the 
Myceliophthora thermophila phytase sequence shown in Fig. 7. 

20 

11. The phytase variant of claim 3 wherein the model phytase 
is an ascomycete consensus phytase sequence. 

12. The phytase variant of any of claims 1 or 2, which is 
25 derived from a basidiomycete phytase. 



13. The phytase variant of claim 12, wherein the model phytase 
is a strain of Paxillus involutus, Trametes pubescens, Agrocybe 
pediades, or Peniophora lycii. 
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14. The phytase variant of claim 13 wherein the model phytase 
is Trametes pubescens CBS 100232 or Paxillus involutus CBS 
100231. 



5 15. The phytase variant of claim 14 wherein the model phytase 
is the Trametes pubescens phytase sequence of Fig. 4 or either 
of the Paxillus involutus phytase sequences of Figs. 2 and 3. 

16. The phytase variant according to any of claims 1 or 2, 
10 which comprises at least one of the following amendments: 

R24C; V27P; H39Q,S; L40N; G42S; 

Q4 3A,C, D,E, F,G, H, I,K,L,M,N, P, R, S, T, V, W, Y; Y44N; A45D,S; F47Y; 

S49P; A51E,R; V56P; A58D,K; V62I; S69Q; Y75W, F; D78S; S79G; 

K80A; G81A,Q,E; K82T; K83A,I,R,Q; Y84Q,I,V; E90R,A; D115N; 
15 D116S; T118V,L; P119E; F120L; E122A; Q123N,T; L125S,M ; V126H,S; 

N127Q,E; S128A,T; F132I,L; I148V; S151A; S153D,Y; S154Q,D,G; 

I157V; A158D; S159T: G160A,S; K161T,N; K162N; F163W; R170fH; 

Q171N; G173Q,S; S184P,Q; E185S; A186E,P; S187A; T190P,A; P193S; 

G194S,T; T195V,L; V198A,N; E200G,V; D201E; S201d(); E201e(),T; 
20 L201f(); preferably all three deletions; A202S; D203R,K,S; 

D203aV,T; V204Q,E,S,A; T211L / V; S215AP; L220N; D223H; T228N; 

T235Y,L; Y236N; L237F; M238L; S242P; I246V; K251eE,Q; H260A; 

I264R; N265Q,A; Q270Y,A,L,G; S271D,N; K273D; Y275F; H278T; 

A280P; T287A; Q288L,I,F; Y292F; A293V; H302R; P304A; N336S; 
25 G337S,T,Q; I339V; S340P,A; F343A, S , F, I , L; N349P; N360R; T362P; 

F364W; S365V,L,A; S366D,V; A367K; W368K; T369I,L; A373S: S374A; 

R375H; L376M; Q383kE; P404A,G; T411K; R417E; F421Y; A431E. 

17. The phytase variant of claim 16, the model phytase of 
30 which is an Aspergillus derived phytase, preferably derived from 

Aspergillus ficuum or Aspergillus niger. 
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18. The phytase variant of claim 17, the model phytase of 
which is a phytase derived from either of Aspergillus ficuum 
(niger) NRRL 3135, Aspergillus niger ATCC 9142, or Aspergillus 

5 niger ATCC 74337. 

19. The phytase variant of claim 18, the model phytase of 
which is the Aspergillus ficuum phytase sequence of Fig. 11. 

10 20. The phytase variant according to any of claims 1 or 2, 
which phytase variant comprises at least one of the following 
amendments : 

A24C; V27P; H39,S,Q; L40N; G42S; Q43C,D,E, F,H,K,M, P,R, S,W,Y; 

Y44N; S45D; F47Y; S49P; E51A, R; L56P; K58D,A; D59G; I62V; S69Q; 
15 Y75W,F; S78D; S79G; K80A; S81A,G,Q,E; K82T; K83A,I,Q,R; 

Y84Q,V, I; V88K; A90R; F102Y; D115N; D116S; T118V,L; P119E; 

F120L; E122A; Q123N,T; L125S,M; V126H,S; N127Q,E; S128A,T; 

F132,I,L; S143N; I148V; S151A; S153D,Y; D154Q,S,G; I157V; A158D; 

S159T; G160A,S; E161T, N; K162N; F163W; G170fH; ()171N; N173Q,S; 
20 T172P; P184Q,S; E185S; S186A,E,P; E187A; T187aS; T190P,A; 

G194S,T; V195L,T; K198A,N,V; E200G,V; A201D,E; S201d(); 

Q201e(),T; L201f(); preferably all three deletions; G202S,A; 

D203R,K,S; E203aV,T; V204Q,E,S f A; A205E; L211V; A220L,N; H223D; 

T228N; E232T; D233E; V235Y,L,T; V236Y,N; L237F; M238L; C242P,S; 
25 T246V; Q251eE,Q; Q256D; H260A; K264R,I; K265Q,A; N267D; 

Q270Y,A,L,G; S271D,N; G273D,K; Y275F; Y278T,H; A280P; A287T; 

Q288L,I / F; F292Y; T293A,V; R302H; P304A; F332F; N336S; 

S337T,G f Q; M338I; V339I; S340P f A; F343A,S,I / L; N349P; E352K; 

S360R; K362P; Y364W f F; S365V,L,A; A366D,V f S; S367A,K; W368K; 
30 V369I,L; G373S,A; R375H; A376M; K383kQ,E; D404A,G; K411T; I393V; 

L412R; K417E,R; W421F,Y; G431E. 
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21. The phytase variant of claim 20, which is derived from an 
Aspergillus phytase, preferably using a model phytase derived 
from Aspergillus fumigatus. 

5 

22. The phytase variant of claim 21, the model phytase of 
which is a phytase derived from either of the following strains 
of Aspergillus fumigatus: ATCC 13073, ATCC 32722, ATCC 58128, 
ATCC 26906 or ATCC 32239. 

10 

23. The phytase variant of claim 22, the model phytase of 
which is the Aspergillus fumigatus phytase sequence of Fig. 8. 

24. The phytase variant according to any of claims 1 or 2, 
15 which phytase variant comprises at least one of the following 

amendments : 

G24C; V27P; H39S,Q; L40N; G42S; 

Q43A,C, D,E, F,G,H, I,K,L,M,N, P, Q, R, S, T, V, W, Y; Y4 4N; S4 5D; Y47F; 

S49P; E51A,R; V56P; D58K,A; D59G; V62I; S69Q; Y75W, F; S78D; 
20 S79G; K80A; S81A,G,Q,E; K82T; A83I,Q,K,R; Y84,Q,I,V; A90R; 

D115N; D116S; T118V,L; F119E; P120L; E122A; N123Q,T; M125S; 

V126H,S; N127Q,E; S128A,T; Y132F,I,L; K143N; I148V; S151A; 

S153D,Y; D154Q,S,G; I157V; A158D; S159T; A160S; E161T,N; K162N; 

F163W; G170fH; S170gA; Q171N; H173Q,S; P184Q,S; E185S; 
25 G186A,E,P; S187A; G187aS; T190P,A; H193S; G194S,T; T195V,L; 

A198N,V; E200G,V; D201E; S201d(); E201e(),T; L201f(); preferably 

all three; G202S,A; D203R,K,S; D203aV,T; V204Q,S,A,E; L211V; 

A215P; L220N; D223H, T228N; E232T; D233E; V235Y,L,T; Y236N; 

L237F; M238L; P242S; E244D; E251e,Q; A256D; H260A; R264I; Q265A; 
30 Q270Y,A,L,G; S271D,N; G273D,K; Y275F; Y278T,H; A280P; A287T; 

Q288L,I,F; F292Y; A293V; R302H; P304A; N336S; S337T,Q,G; M338I; 
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I339V; S340P,A; F343A, S, I , L; N349P; A352K; S360R; E362P; 
Y364W, F; S365V,L,A; A366D,V,S; S367K,A; W368K; T369I,L; G373S,A; 
A374S; R375H; A376M; Q383kE; A404G; K411T; E417R; F421Y; A431E. 

5 25. The phytase variant of claim 24, the model phytase of 
which is an ascomycete consensus phytase. 

26. The phytase variant of claim 25, the model phytase of 
which is the ascomycetes consensus sequence "conphys" of Fig. 9. 

10 

27. The phytase variant according to any of claims 1 or 2, 
which phytase variant comprises at least one of the following 
amendments : 

V24C; F27P; ()31Y; F33C; D39H,S,Q; S40L,N; A42S,G; 
15 A43C,D,E, F,G,H,I,K,L,M,N, P, Q, R, S, T, V, W, Y; Y44N; T45D,S; Y47F; 

Q51E,A,R; K58D,A; K61R; I62V; F75W; S78D; A80K; G81A,Q,E; 

R83A, I,Q,K; I84Y,Q,V; V88I; K90R,A; L102Y; D115N; D116S; V118L; 

P119E; F120L; L123N,T,Q; S125M; S126H,V; Q127E,N; A128S,T; 

T132F, I,L; E143N; V148I; S151A; S152G; S153D,Y; N154D, Q, S, G; 
20 D158A; S159T; A160S; T161N; ()170fH; ()170gA; ()171N; H173Q,S; 

H172P; S184Q, P; E185S; S186A,E,P; L187A; ()187aS; T190P,A; 

D193S; A194S,T; M195T,V,L; N198A,V; G200V; S201D,E ()201eT; 

S202A; D203R, K,S; P203aV,T; Q204E,S,A,V; T205E; I211L,V; P215A; 

L220N; Q223D,H; A232T; D233E; S235Y,L,T; N236Y; L237F; I238L,M; 
25 A242P,S; E244D; I246V; ()251eE,Q; N256D; P260A,H; A264R, I; 

Q265A; E267D; G270Y,A,L; L332F; D271N; D273K; F275Y; T278H; 

Y280A, P; Y283P; V287A,T; Q288L,I,F; Y292F; I293A,V; E302R,H; 

P304A; L332F; N336S; Q337T,S,G; M338I; I339V; A340P; 

S343A, F, I , L; F348Y; N349P; S352K; P360R; R362P; W364F; 
30 V365L,A,S; T366D,V,S; S367K,A; R368K; L369I; T370V; S373A; 
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A374S; R375H; S383kQ,E; T387P; A396R; G404A; L409R; T411K; 
L412R; E417R; Y421F. 



28. The phytase variant of claim 27, the model phytase of 
5 which is a phytase derived from Agrocybe pediades, 



29. The phytase variant of claim 27, the model phytase of 
which is a phytase derived from Agrocybe pediades CBS 900.96. 

10 30. The phytase variant of claim 29, the model phytase of 
which is the Agrocybe pediades phytase sequence of Fig. 5. 

31. The phytase variant according to any of claims 1-2, which 
phytase variant comprises at least one of the following 
15 amendments: 

F24C; V27P; L31Y; I33C; S39H,Q; N40L; G42S; 

P43A,C,D,E,F,G,H,I,K,L,M,N,Q,R,S,T,V,W,Y; Y44N; D45S; F47Y; 

E51A,R; E58D,K,A; T61R; V62I; W75F; S78D; A80K; R81Q,E,G,A; 

S82T; R83A, I , Q, K; Q84Y,V,I; V88I; K90R,A; A115N; D116S; L118V; 
20 P119E; F120L; N123T,Q; S125M; H126S,V; Q127E,N; T128A,S; 

M132F, I,L; G143N; V148I; A151S; D153Y; Q154D,S,G; D158A; S159T; 

S160A; T161N; ()170fH; ()170gA; S171NG172P; E173Q,S; Q184S,P; 

E185S; E186A, P; G187A; ()187aS; T190P,A; N193S; N194S,T; 

M195T,V,L; N198A,V; V200G; D201E; ()201eT; G202S,A; D203R,K,S; 
25 ()203aV,T; E204Q,S,A,V; S205E; V211L; N215A,P; L220N; A223D,H; 

S232T; D233E; L235Y,T; T236Y,N; L237F; M238L; P242S; L246V; 

()251eE,Q; A260H; V264R,I; S265Q,A; E267D; Y270A,L,G; D271N; 

D273K; F275Y; G278T,H; P280A; A283P; T287A; Q288L,I,F; Y292F; 

V293A; G302R,H; A304P; N336S; T337Q,S,G; M338I; V339I; P340A; 
30 A343S,F, I,L; F348Y; N349P; A352K; E360R; R362P; W364F; 
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V365L,A,S; D366V,S; S367K,A; L369I; S373A; G374A,S; ()383kQ,E; 
E387P; A396R; G404A; V409R; E411K,T; L412R; E417R; Y421F; A431E. 

32. The phytase variant of claim 31, the model phytase of 
5 which is a phytase derived from Peniophora lycii. 

33. The phytase variant of claim 32, the model phytase of 
which is a phytase derived from Peniophora lycii CBS 68 6.96. 

10 34. The phytase variant of claim 33, the model phytase of 
which is the Peniophora lycii phytase sequence of Fig. 6. 

35. A phytase polypeptide which comprises a phytase variant 
according to any of the previous claims. 

15 

36. A DNA construct comprising a DNA sequence encoding a 
phytase variant according to any one of claims 1-34. 

37. A recombinant expression vector which comprises a DNA 
20 construct according to claim 36. 

38. A host cell which is transformed with a DNA construct 
according to claim 36 or a vector according to claim 37. 

25 39. A process for preparing a phytase variant, the process 
comprising culturing the host cell according to claim 38 under 
conditions permitting the production of the phytase variant, and 
recovering the phytase from the culture broth. 

30 40. A feed or food comprising at least one phytase variant of 
any of claims 1-34. 
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41. A process for preparing a feed or food according to claim 
40, wherein the at least one phytase variant is added to the 
food or feed components. 

5 

42. A composition comprising at least one phytase variant of 
any of claims 1-34. 

43. The composition according to claim 42 suitable for use in 
10 food or feed preparations. 

44. The composition according to any of claims 42-43 which is 
an animal feed additive. 

15 45. A process for reducing phytate levels in animal manure 
comprising feeding an animal with an effective amount of the 
feed according to claim 40 or obtainable according to claim 41. 

46. Use of the phytase variant of any of claims 1-34; or the 
20 composition of any of claims 42-43 for liberating phosphorous 

from a phytase substrate. 

47. A transgenic plant or plant part which is capable of 
expressing a phytase variant according to any one of claims 1- 

25 34 . 
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15 



Peniophora numbers l 

Alignment numbers 1 

P_involtus_Al ML FGFVALACLL 

P_involtus_A2 MH LGFVTLACLI 

T_pubescens MAFSILASLL 

A_ pediades MSLFIGGCLL 

PJLycii MV SSAFAPSILL 

A_fumigatus MVTL TFLLSAAYLL 

consphyA MGVF WLLSIATLF 

A__nidulans MAFF TVALSLYYLL 

A_ficuumJtJRRL3135 MGVS AVLLPLYLLS 

A_terreus M6FL AXVLSVALLF 

T_thermo MSLL LLVLSGGLVA 

T_lanuginosa MAGIGLGSFL VLLLQFSALL 

M_thermophila MTGL GVMWMVGFL 



37 
50 

SLSEVLATSV P KNT APTFPIPESE 

HLSEVFAASV P RNI APKFSIPESE 

FVCYAYARAV PRAHIPLRDT SACLDVTRDV 

VFLQASAYGG WQATFVQPF FPPQI 

SLMSSLALST QFSF V AAQLP IPAQN 

.SGRVSAAPS SAGSKSCDTV DLGYQCS PAT 
GSTSGTALGP RGNSHSCDTV DGGYQCFPEI 
. . SRVSAQAP WQNHSCNTA DGGYQCFPNV 
GVTSGLAVPA SRNQSSCDTV DQGYQCFSET 
RSTSGTPLGP RGKHSDCNSV DHGYQCFPEL 

LYVS RNP HVDSHSCNTV EGGYQCRPE I 

TASPAIPPFW RKKHPNVD j 

AIASL QSESRPCDTP DLGFQCGTAI 



25 



2 0 P_involtus_Al 
P_involtus_A2 
T_pubescens 
A_pediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A__f i cuum_NRRL3 13 5 
A_terreus 
30 T_thermo 
T_lanuginosa 
M_t hermophi 1 a 



38 

51 

QRNWSPYSPY 
QRNWSPYSPY 
QQSWSMYSPY 
QDSWAAYTPY 
TSNWGPYDPF 
SHLWGQYSPF 
SHLWGQYSPY 
SHVWGQYSPY 
SHLWGQYAPF 
SHKWGLYAPY 
SHSWGQYSPF 
ARHWGQYS PF 
SHFWGQYSPY 



FPLAEYKA. 



FPAATYVA. . 
YPVQAYTP . . 
FPVEPYAA. . 
FSLEDELSVS 
FSTiKDESAIS 
FSIEQESAIS 
FSLANESVIS 
FSIiQDESPFP 
FSLADQSEXS 
FSLAEVSEIS 
FSVP. .SELD 



. . PPAGCQIN 

- . PPAGCEIN 

. . PPASCQIN 

. . PPKDCKIT 

. . PPEGCTVT 

SKLPKDCRIT 

PDVPDDCRVT 

EDVPHGCEVT 

PEVPAGCRVT 

LDVPEDCHIT 

PDVPQNCKIT 

PAVPKGCRVE 

ASIPDDCEVT 



QVNIIQRHGA 
QVNIIQRHGA 
QVHIIQRHGA 
QVNIIQRHGA 
QVNLIQRHGA 
LVQVLSRHGA 
FVQVLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 
FVQVLARHGA 
FVQLIiSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 



83 
100 

RFPTSGATTR 
RFPTSGAATR 
RFPTSGAAKR 
RFPTSGAGTR 
RWPTSGARSR 
RYPTSSKSKK 
RYPTSSKSKA 
RYPTESKSKA 
RYPTDS KGKK 
RSPTHSKTKA 
RYPTSSKTEL 
RYPTAHKSEV 
RAPTLKRAAS 



40 



35 

P_involtus_Al 
P_i nvo 1 tus_A2 
T_pubescens 
A_ped±ades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f icuum_NRRL3 13 5 
45 A_terreus 

T_thermo 
T_lanuginosa 
M_thermophi 1 a 



50 



84 
101 

IKAGLTKLQG 

IKAGLSKLQS 

IQTAVAKLKA 

IQAAVKKLQS 

QVAAVAKIQM 

YKKLVTAIQA 

YSALIEAIQK 

YSGLIEAIQK 

YSALIEEIQQ 

YAATIAAIQK 

YSQLISRIQK 

YAELLQRIQD 

YVDLIDRIHH 



VQNFTDAKFN 
VQNFTDPKFD 
ASNYTDPLLA 
AKTYTDPRLD 
ARPFTDPKYE 
NATDFKGKFA 
NATAFKGKYA 
NATS FWGQ YA 
NATTFDGKYA 
SATAFPGKYA 
TATAYKGYYA 
TATEFKGDFA 
GAISYGPGYE 



FIKSFKYDLG 
FIKSFTYDLG 
FVTNYTYSLG 
FLTNYTYTLG 
FLNDFVYKFG 
FLKTYNYTLG 
FLKTYNYTLG 
FLESYNYTLG 
FLKTYNYSLG 
FLQSYNYSLD 
FLKDYRYQLG 
FLRDYAYHLG 
FLRTYDYTLG 



NSDLVPFGAA 

TSDLVPFGAA 

QDSLVELGAT 

HDDLVPFGAL 

VADLLPFGAN 

ADDLTPFGEQ 

ADDLTPFGEN 

ADDLTIFGEN 

ADDLTPFGEQ 

SEELTPFGRN 

ANDLTPFGEN 

ADNLTRFGEE 

ADELTRTGQQ 



133 
150 

QSFDAGQEAF 
QSFDAGLEVF 
QSSEAGQEAF 
QSSQAGEETF 
QSHQTGTDMY 
QLVNSGIKFY 
QMVNSGIKFY 
QMVDSGAKFY 
ELVNSGIKFY 
QLRDLGAQFY 
QMIQLGIKFY 
QMMESGRQFY 
QMVNSGIKFY 



P_involtus_Al 
P involtus A2 



134 176 
151- 200 

ARYSKLVSKN NLPFIRADGS DRWDSATNW TAGFASA SHNTVQ 

ARYSKLVSSD NLPFIRSDGS DRWDTATNW TAGFASA SRNAIQ 



Fig. 1A 
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T_j?ubescens 
A_pediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f icuum_NRRL3 135 
A_terreus 
T_thermo 
T_lanuginosa 
M_thermophila 



TRYS SLVSAD 
QRYS FLVSKE 
TRYSTIiFEGG 
QRYKAL.ARS 
RRYKAL.ARK 
RRYKNL.ARK 
QRYESL.TRN 
ERYNAL .TRH 
NHYKSL . ARN 
HRYREQ .ARE 
RRYRAL .ARK 



ELPFVRASGS 
NLPFVRASSS 
DVPFVRAAGD 
WPFIRASGS 
IVPFIRASGS 
NTPFIRASGS 
IVPFIRSSGS 
INPFVRATDA 
AVPFVRCSGS 
IVPFVRAAGS 
S I PFVRTAGQ 



DRWATANNW 
NRWDSATNW 
QRWDSSTNW 
DRVIASGEKF 
DRVXASAEKF 
DRWASAEKF 
SRVTASGKKF 
SRVHESAEKF 
DRVIASGRIiF 
ARVIASAEFF 
DRWHSAENF 



TAGFALA. . . 

TEGFSAA. . . 

TAGFGDA. . . 

IEGFQQAKLA 

IEGFQSAKLA 

INGFRKAQLH 

IEGFQSTKLK 

VEGFQTARQD 

IEGFQSAKVL 

NRGFQDAKDR 

TQGFHSALLA 



. . . .SSNSIT 
.... SHHVLN 
.... SGETVL 
DPGA.TNRAA 
DPGSQPHQAS 
DHGS . . KRAT 
DPRAQPGQSS 
DHHANPHQPS 
DPHSDKHDAP 
DPRSNKDQAE 
DRGSTVRPTL 



15 



20 



25 



30 



P_invo 1 tus_Al 
P_invol tus_A2 
T_pubescens 
A_jpediades 
P_lycii 
A_f umigatus 
consphyA 
A_nidulans 
A_f i CUum_NRRL 3 13 5 
A_terreus 
T_thermo 
T_lanuginosa 
M_thermophila 



35 



P_involtus_Al 
P_involtua_A2 
Tjpubescens 
A_pediades 
P_lycii 
A^f umigatus 
consphyA 
A_nidulans 
A_f icuum_NRRL3 135 
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40 



177 
201 

PKLNLILPQT 
PKLDLILPQT 
PVLSVIISEA 
PILFVILSES 
PTLQWIiQEE 
PAISVTIPES 
PVIDVIIPEG 
PWNVIIPEI 
PKIDWISEA 
PRVDVAIPEG 
PTINVIIEEG 
PVINVTISEE 
PYDMWI PET 

218 
251 

ARLNAAAPSV 
AQLNAAAPGA 
ARLNAGAPGA 
NRLNQQAPGA 
ARLNAAAPSA 
ARAEKHLPGV 
ARLEADLPGV 
KRLENDLPGI 
QRLENDLSGV 
QRLEADLPGV 
EKIKDHLPGV 
KKITKHMPGV 
ARVNANLPGA 



G. -NDTLEDN 
G. .NDTLEDN 
G. .NDTLDDN 
Ii. .NDTLDDA 
G. -NCTLCNN 
ETFNNTLDHG 
SGYNNTLDHG 
DGFNNTLDHS 
S S SNNTLDPG 
SAYNNTLEHS 
PSYNNTLDTG 
TGSNNTLDGL 
AGANNTLHND 



MCPAAGD. . . 
MCPAAGE. . . 
MCPAAGD . . . 
MCPNAGS. . . 
MCPNEVD. . . 
VCTKFEA. . . 
TCTAFED. . . 
T CVS FEN. . . 
TCTVFED. . . 
LCTAFES. . . 
SCPVFED. . . 
TCPAAEE. . . 
LCTAFEEGPY 



. . SDPQVNA 
. . SDPQVDA 
. . SDPQVNQ 
. . SDPQTGI 
. .GD.ESTT 
SQLGDEVAAN 
SELGDDVEAN 
DERADEIEAN 
SELADTVEAN 
STVGDDAVAN 
SSGGHDAQEK 
AP.DPTQPAE 
STIGDDAQDT 



217 
250 

WLAVAFPSIT 
WLASAFPSVT 
WLAQFAPPMT 
WTSIYGTPIA 
WLGVFAPNIT 
FTALFAPDIR 
FTALFAPAIR 
FTAIMGPPIR 
FTATFVPSIR 
FTAVFAPAIA 
FAKQFAPAIIj 
FLQVFGPRVL 
YLSTFAGPIT 



NLTDTDAFNL 
NLTDADAFNL 
NLTDTDTYNL 
NITAADVSNIi 
NLSDSDALTL 
TLTDEDWSL 
TLTDEDWYL 
KLTNENVIYL 
TLTDTEVTYL 
QLSTDDWNIi 
DLAVSDVPYL 
NLTLEDVPLF 
NLTDADTVAL 



VSLCAFLTVS 

VSLCPFMTVS 

LTLCPFETVA 

IPLCAFETIV 

MDMCPFDTLS 

MDMCS FDTVA 

MDMCPFETVA 

MDMCSFDTMA 

MDMCS FDTXS 

MAMCPFETVS 

MDLCPFETLA 

MDLCPFDTVG 

MDLCPFETVA 



252 
300 

KEKK S 

KEQK S 

TERR S 

KETP S 

SGNA S 

RTSD. .ASQ LS 

RTSD. .ATE LS 

RTAH. -GTE LS 

TSTV. .DTK LS 

LTDD. .AHT. LS 

RNHT . • DT • • LS 

SDPVLFPRQ LS 

SSSSDPATAD AGGGNGRPLS 



50 



55 



P_involtus_Al 
P_involtus_A2 
T_pubescens 
A_pediades 
P_lycii 
A_fumigatus 
consphyA 
A nidulans 



253 
301 

DFCTLFEGIP 
DFCTLFEGIP 
EFCDIYEELQ 
PFCNLFT. .P 
PFCDLFT. .A 
PFCQLFT . . H 
PFCALFT. .H 
PFCAIFT. .E 



GSFEAFAYGG 
GSFEAFAYAG 
AE.DAFAYNA 
EEFAQFEYFG 
EEYVSYEYYY 
NEWKKYNYLQ 
DEWRQYDYLQ 
KEWLQYDYLQ 



DLDKFYGTGY 
DLDKFYGTGY 
DLDKFYGTGY 
DLDKFYGTGY 
DLDKYYGTGP 
SLGKYYGYGA 
SLGKYYGYGA 
SLSKYYGYGA 



GQELGPVQGV 
GQALGPVQGV 
GQPLGPVQGV 
GQPLGPVQGV 
GNALGPVQGV 
GNPLGPAQGI 
GNPLGPAQGV 
GSPLGPAQGI 



300 
350 

GYVNELIARL 
GYINELLARL 
GYINELIARL 
GYINELLARL 
GYVNELLARL 
GFTNELIARL 
GFANELIARL 
GFTNELIARL 
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A_f icuum_NRRL3 13 5 
A_terreus 
T_thermo 
T_lanuginosa 
M_the rmophi 1 a 



10 



15 



20 



P_involtus_Al 
P_involtus_A2 
T_jpubescens 
Ajpediades 
P_lycii 
A_fumigattis 
consphyA 
A_nidulans 
A_f i cuum_NRRL3 135 
A_terreus 
T_thermo 
T_ lanuginosa 
M_thermophila 



PFCDLFT. .H DEWINYDYLQ 
PFCDLFT. .A TEWTQYNYLL 
PFCALST. .Q EEWQAYDYYQ 
PFCKLFT. .A DDWMAYDYYY 
PFCRLFS . • E SEWRAYDYLQ 



SLKKYYGHGA 
SLDKYYGYGG 
SLGKYYGNGG 
TLDKYYSHGG 
SVGKWYGYGP 



GNPLGPTQGV 
GNPLGPVQGV 
GNPLGPAQGV 
GSAFGPSRGV 
GNPLGPTQGV 



301 
351 

TUS . AVRDNT 
TOS.AVNDNT 
TAQ.NVSDHT 
TEM.PVRDNT 
TGQ.AVRDET 
TRS . PVQDHT 
TRS . PVQDHT 
TQS . PVQDNT 
TBS . PVHDDT 
TRA.PVHDHT 
THS . PVQDYT 
TGNLPVKDHT 
A.GVPVRDGT 



QTNRTLDASP 
QTNRTLDAAP 
QTNSTLDSSP 
QTMRTLDSSP 
QTNRTLDSDP 
STNSTLVSNP 
STNHTLDSNP 
STNHTLDSNP 
SSNHTLDSSP 
CVNNTLDASP 
TVNHTLDSNP 
TVNHTLDDNP 
STNRTIiDGDP 



VTFPLNKTFY 
DTFPLNXTMY 
ETFPLNRTLY 
LTFPLDRSIY 
ATFPLNRTFY 
ATFPLNATMY 
ATFPLNATLY 
ATFPLDRKLY 
ATFPLKSTLY 
ATFPLNATLY 
ATFPLNATLY 
ETFPLDAVLY 
RTFPLGRPLY 



ADFSHDNLMV 

ADFSHDNLMV 

ADFSHDNQMV 

ADLSHDNQMI 

ADFSHDNTMV 

VDFSHDNSMV 

ADFSHDNSMI 

ADFSHDNSMI 

ADFSHDNGII 

ADFSHDSNLV 

ADFSHDNTMT 

ADFSHDNTMT 

ADFSHDNDMM 



GYANELIARL 
GWANELMARL 
GFVNELIARM 
GFVNELIARM 
GFVNELLARL 

349 
400 

AVFSAMGLFR 
AVFSAMGLFR 
AIFSAMGLFN 
AIFSAMGLFN 
PIFAALGLFN 
SIFFALGLYN 
SIFFALGLYN 
SIFFAMGLYN 
SILFALGLYN 
SIFWALGLYN 
S I FAALGL YN 
GIFSAMGLYN 
GVLGALGAYD 



25 



P_involtus_Al 
P_involtus_A2 
T_jpubescens 



30 



35 



F_lycii 
A_fumigatus 
consphyA 
A_nidul ans 
A_f i cirum_NRRL3 135 
A_terreus 
T_thernio 
T_lanuginosa 
M_thermophi la 



350 
401 

QPAPLSTSVP 

QSAPLSTSTP 

QSAPLDPTTP 

QSSPLDPSFP 

ATA . LDPLKP 

GTEFLSRTSV 

GTAPLSTTSV 

GTQPLSMDSV 

GTKPLSTTTV 

GTAPLSQTSV 

GTAKLSTTEI 

GTKPLSTSKI 

GVPPIiDKTAR 



NPWR T WRTSSLVPFS 

DPNR T WLTSSWPFS 

DPAR T FLVKKIVPFS 

NPKR T WVTSRLTPFS 

DENR L WVDSKLVPFS 

ESAKE . . LDG YSASWWPFG 
ESIEE..TDG YSASWTVPFG 
ESIQE..MDG YAASWTVPFG 
ENITQ.-TDG FSSAWTVPFA 
ESVSQ..TDG YAAAWTVPFA 
KSIEE..TDG YSAAWTVPFG 
QPPTGAAADG YAASWTVPFA 
RDPEE . . LGG YAASWAVPFA 



383 

450 

GRMWERLSC 

ARMAVERLSC 

ARMWERLDC 

ARMVTERLLC QRDGTGSGGP 

GHMTVEKLAC 

ARAYFETMQC 

ARAYVEMMQC 

ARAYFELMQC 

SRLYVEMMQC 

ARAYVEMMQC 

GRAYIEMMQC 

ARAYVELLRC ETETSSEEEE 
ARIYVEKMRC SGGGGGGGGG 



40 384 

451 

P_involtus_Al ....... FGT 

P__involtus_A2 AGT 

T_pubescens GGA 

4 5 A_pedi ades SRIMRNGNVQ 

P_lycii SGK 

A_f umiga tus K. .s EKE 

consphyA Q.,A. ..EKE 

A_nidulans E KKE 

50 A_ficuum_NRRL3135 Q. .A. . . EQA 

A_terreus R. .A. . . EKE 

T_thernio D . .b. . . sde 

T_lanuginosa E. .G. . .EDE 

M_thermophila E . . GRQEKDE 

55 

426 



TKVRVLVQDQ 

TKVRVIiVQDQ 

QSVRLLVNDA 

TFVRILVNDA 

EAVRVLVNDA 

PLVRALINDR 

PLVRVLVNDR 

PLVRVLVNDR 

PLVRVLVNDR 

PLVRVLVNDR 

PWRVLVNDR 

PFVRVLVNDR 

EMVRVLVNDR 

439 



VQPLEFCGGD 

VQPLEFCGGD 

VQPLAFCGAD 

LQPLKFCGGD 

VQPLEFCGG. 

WPLHGCDVD 

WPLHGCAVD 

WPLHGCAVD 

WPLHGCPVD 

VMPLHGCPTD 

WPLHGCEVD 

WPLHGCRVD 

VMTLKGCGAD 



RNGLCTLAKF 

QDGLCALDKF 

TSGVCTLDAF 

MDSLCTLEAF 

VDGVCELSAF 

KLGRCKLNDF 

KLGRCKRDDF 

KFGRCTLDDW 

ALGRCTRDSF 

KLGRCKRDAF 

SLGRCKRDDF 

RWGRCRRDEW 

ERGMCTLERF 



425 
500 

VESQTFARSD 

VESQAYARSG 

VESQAYARND 

VESQKYARED 

VESQTYAREN 

VKGLSWARSG 

VEGLS FARSG 

VEGLNFARSG 

VRGLS FARSG 

VAGLS FAQAG 

VRGLS FARQG 

IKGLTFARQG 

IESMAFARGN 
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P_involtus_Al 
P_involtus_A2 
T_pubescens 
5 Ajpediades 

P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
10 A_f icuuin_NRRIi3135 

A_terreus 
T_therrno 
T_lanuginosa 
M_t he rmophi 1 a 

15 



501 514 
GAGDFBKCFA TSA 
GAGDFEKdiA TTV 
GEGDFEKCFA T . . 
GQGDFEKCFD . . . 
6Q6DFAKCGF VPSE 
- . GNWGECFS . . . 
. . GNWAECFA * . . 
. .GNWKTCFT L . . 
. • GDWAECFA . . . 

..GNWADCF. 

. . GNWEGCYA ASE 
. - GHWDRCF - ... 
♦ . GKWDLCFA . . . 
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(2) INFORMATION FOR SEQ ID NO: 25: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1522 base pairs . 

(B) TYPE: nucleic acid 
<C) STRANDS DNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paxillus involutus 

(B) STRAIN: CBS 100231 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 58. .1383 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 115 . .1383 

(ix) FEATURE:. 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 58. .114 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGATCCGAAT TCGGCACTCG TACGGTCCCC CGGTCTACCC TCTGCTCGCC TTGGAAG 57 

ATG CTC TTC GGT TTC GTC GCC CTC GCC TGT CTC TTG TCC CTC TCC GAG 105 
Met Leu Phe Gly Phe Val Ala Leu Ala Cys Leu Leu Ser Leu Ser Glu 
-19 -15 -10 -5 

GTC CTT GCG ACC TCC GTG CCC AAG AAC ACA GCG CCG ACC TTC CCC ATT 153 
Val Leu Ala Thr Ser Val Pro Lys Asn Thr Ala Pro Thr Phe Pro He 

15 10 

CCG GAG AGT GAG CAG CGG AAC TGG TCC CCG TAC TCG CCC TAC TTC CCT 201 
Pro Glu Ser Glu Gin. Arg Asn Trp Ser Pro Tyr Ser Pro Tyr Phe Pro 
15 20 25 

CTT GCC GAG TAC AAG GCT CCT CCG GCG GGC TGC CAG ATC AAC CAG GTC 249 
Leu Ala Glu Tyr Lys Ala Pro Pro Ala Gly Cya Gin He Asn Gin Val 
30 35 40 45 

AAC ATC ATC CAA AGA CAT GGT GCC CGG TTC CCG ACC TCT GGC GCG ACC 297 
Asn He He Gin Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Thr 

50 55 60 

ACC CGT ATC AAG GCG GGT TTG ACC AAG TTG CAA GGC GTC CAG AAC TTT 345 
Thr Arg He Lys Ala Gly Leu Thr Lys Leu Gin Gly Val Gin Asn Phe 

65 70 * 75 

ACC GAC GCC AAA TTC AAC TTC ATC AAG TCG TTC AAG TAC GAT CTC GGT 393 
Thr Asp Ala Lys Phe Asn Phe He Lys Ser Phe Lys Tyr Asp Leu Gly 
80 85 90 

AAC TCG GAC CTC GTT CCG TTC GGT GCA GCA CAG TCC TTC GAC GCT GGT 441 
Asn Ser Asp Leu Val Pro Phe Gly Ala Ala Gin Ser Phe Asp Ala Gly 
95 100 105 



Fig. 2A 
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CAG GAG GCC TTC GCC CGC TAC TCG AAG CTT GTC AGC AAG AAC AAC CTG 489 
Gin Glu Ala Phe Ala Arg Tyr Ser Lys Leu Val Ser Lys Asn Asn Leu 
110 115 120 125 

CCG TTC ATT CGT GCC GAT GGA AGT GAT CGT GTT GTG GAT TCT GCT ACA 537 
Pro Phe He Arg Ala Asp Gly Ser Asp Arg Val Val Asp Ser Ala Thr 

130 135 140 

AAC TGG ACT GCG GGT TTC GCT TCG GCA AGT CAC AAC ACG GTC CAG CCC 585 
Asn Trp Thr Ala Gly Phe Ala Ser Ala Ser His Asn Thr Val Gin Pro 

145 150 155 

AAG CTG AAC CTG ATT CTC CCG CAA ACT GGC AAT GAT ACC CTG GAA GAT 633 
Lys Leu Asn Leu lie Leu Pro Gin Thr Gly Asn Asp Thr Leu Glu Asp 
160 165 170 

AAT ATG TGC CCT GCT GCT GGC GAT TCT GAC CCC CAG GTC AAC GCG TGG 681 
Asn Met Cys Pro Ala Ala Gly Asp Ser Asp Pro Gin Val Asn Ala Trp 
175 180 185 

TTG GCT GTT GCT TTC CCT TCC ATC ACT GCA CGG CTC AAC GCC GCC GCG 729 
Leu Ala Val Ala Phe Pro Ser He Thr Ala Arg Leu Asn Ala Ala Ala 
190 195 200 205 

CCC TCT GTC AAC CTC ACC GAC ACG GAC GCG TTC AAC CTC GTC AGT CTC 777 
Pro Ser Val Asn Leu Thr Asp Thr Asp Ala Phe Asn Leu Val Ser Leu 

210 215 220 

TGC GCT TTC TTG ACA GTC TCG AAG GAG AAG AAG AGT GAC TTC TGC ACC 825 
Cys Ala Phe Leu Thr Val Ser Lys Glu Lys Lys Ser Asp Phe Cys Thr 

225 230 235 

CTG TTC GAG GGC ATC CCT GGC TCT TTC GAG GCG TTC GCC TAT GGT GGC 873 
Leu Phe Glu Gly lie Pro Gly Ser Phe Glu Ala Phe Ala Tyr Gly Gly 
240 245 250 

GAC CTT GAC AAG TTC TAC GGT ACC GGT TAC GGT CAG GAA CTC GGA CCC 921 
Asp Leu Asp Lys .Phe Tyr Gly Thr Gly Tyr Gly Gin Glu Leu Gly Pro 
255 260 265 

GTT CAA GGC GTC GGC TAC GTC AAC GAG CTC ATC GCC CGC CTC ACC AAC S69 
Val Gin Gly Val Gly Tyr Val Asn Glu Leu He Ala Arg Leu Thr Asn 
270 275 280 285 

TCC GCC GTC CGC GAC AAC ACC CAG ACG AAC CGC ACA CTC GAC GCC TCG 1017 
Ser Ala Val Arg Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp Ala Ser 

290 295 300 

CCC GTA ACC TTC CCG TTG AAC AAG ACG TTC TAC GCC GAT TTC TCC CAC 1065 
Pro Val Thr Phe Pro Leu Asn Lys Thr Phe Tyr Ala Asp Phe Ser His 

305 310 315 

GAC AAC CTC ATG GTC GCC GTC TTC TCC GCC ATG GGC CTC TTC CGC CAG 1113 
Asp Asn Leu Met Val Ala Val Phe Ser Ala Met Gly Leu Phe Arg Gin 
320 .325 330 

CCC GCG CCG CTC AGC ACG TCC GTG CCG AAC CCA TGG CGC ACG TGG CGC 1161 
Pro Ala Pro Leu Ser Thr Ser Val Pro Asn Pro Trp Arg Thr Trp Arg 
335 340 345 
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ACG AGC TCC CTC GTC CCC TTC TCC GGA CGC ATG GTC GTG GAA CGC CTC 1209 
Thr Ser Ser Leu Val Pro Phe Ser Gly Arg Met Val Val Glu Arg Leu 
350 355 360 365 

AGC TGT TTC GGC ACG ACC AAG GTT CGC GTC CTC GTG CA.G GAC CAG GTG 1257 
Ser Cys Phe Gly Thr Thr Lys Val Arg Val Leu Val Gin Asp Gin Val 

370 375 380 

CAG CCG CTC GAG TTC TGC GGG GGT GAT AGG AAC GGG CTG TGC ACG CTT 1305 
Gin Pro Leu Glu Phe Cys Gly Gly Asp Arg Asn Gly Leu Cys Thr Leu 

385 390 395 

GCT AAG TTT GTG GAG AGC CAG ACG TTT GCG AGG AGT GAT GGT GCG GGG 1353 
Ala Lys Phe Val Glu Ser Gin Thr Phe Ala Arg Ser Asp Gly Ala Gly 
400 405 410 

GAC TTT GAG AAG TGC -TTC GCG ACC TCG GCG TGAGGATGGA CGAACAAAAT 1403 
Asp Phe Glu Lys Cys Phe Ala Thr Ser Ala 
415 420 

TAAATTGGGG TATTTTATCG TATAATTATG GTGTGTGTAG AACATGGGCT CGGGGTCGAT 14 63 
GGTGAAAAGC AAAGGTTTAT CGTCTAAAAA AAAAAAAAAA AAAAAATTCC TGCGGCCGC 1522 
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(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paxillus involutus 

(B) STRAIN: CBS 100231 

(ix) FEATURE: 

<A) NAME /KEY: CDS 

(B) LOCATION: 48.. 1373 

(ix) FEATURE : 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 105. .1373 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 48.. 104 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGATCCGAAT TCCAGTCCCC AAGCTAATCC TCTGCTCGCC TTGGAAG ATG CAC CTC 56 

Met His Leu 
-19 

GGC TTC GTC ACC CTC GCT TGT CTC ATA CAC CTC TCC GAG GTC TTC GCG 104 
Gly Phe Val Thr Leu Ala Cys Leu lie His Leu Ser Glu Val Phe Ala 
-15 -10 -5 

GCA TCC GTG CCC CGG AAT ATT GCT CCG AAG TTC TCA ATT CCG GAA AGC 152 
Ala Ser Val Pro Arg Asn lie Ala Pro Lys Phe Ser lie Pro Glu Ser 
15 10 15 

GAG CAG CGA AAC TGG TCG CCT TAC TCT CCT TAG TTT CCC CTA GCC GAA 200 
Glu Gin Arg Asn Trp Ser Pro Tyr Ser Pro Tyr Phe Pro Leu Ala Glu 

20 25 30 

TAC AAG GCT CCT CCA GCA GGC TGC GAG ATT AAC CAA GTC AAT ATT ATC 248 
Tyr Lys Ala Pro Pro Ala Gly Cys Glu He Asn Gin Val Asn He He 
35 40 45 

CAA CGG CAT GGC GCA CGG TTC CCA ACC TCG GGT GCG GCC ACT CGC ATC 296 
Gin Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Ala Thr Arg He 
50 55 go 

AAG GCT GGT TTA AGC AAG CTG CAA TCC GTC CAG AAT TTC ACC GAC CCC 344 
Lys Ala Gly Leu Ser Lys Leu Gin Ser Val Gin Asn Phe Thr Asp Pro 
65 70 75 80 

AAA TTC GAC TTC ATC AAG TCG TTC ACA TAC GAT CTT GGT ACT TCC GAC 392 
Lys Phe Asp Phe He Lys Ser Phe Thr Tyr Asp Leu Gly Thr Ser Asp 

85 90 95 



Fig. 3A 
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CTC GTG CCA TTC GGC GCA GCA CAA TCA TTC GAT GCC GGC CTG GAG GTC 440 
Leu Val Pro Phe Gly Ala Ala Gin Ser Phe Asp Ala Gly Leu Glu Val 

100 105 HO 

TTC GCT CGC TAT TCG AAG CTC GTC AGC TCG GAC AAC CTG CCT TTC ATT 488 
Phe Ala Arg Tyr Ser Lys Leu Val Ser Ser Asp Asn Leu Pro Phe lie 
115 120 125 

CGC TCA GAT GGT AGC GAT CGT GTA GTC GAC ACT GCT ACG AAC TGG ACT 536 
Arg Ser Asp Gly Ser Asp Arg Val Val Asp Thr Ala Thr Asn Trp Thr 
130 135 140 

GCA GGT TTT GCT TCC GCG AGC CGC AAC GCG ATC CAA CCC AAG CTC GAC 584 
Ala Gly Phe Ala Ser Ala Ser Arg Asn Ala lie Gin Pro Lys Leu Asp 
145 150 155 1«0 

TTG ATA CTT CCA CAA ACT GGC AAT GAC ACC CTC GAG GAC AAC ATG TGT 632 
Leu lie Leu Pro Gin Thr Gly Asn Asp Thr Leu Glu Asp Asn Met Cys 

165 170 175 



CCA GCT GCT GGC GAA TCC GAC CCT CAG GTC GAT GCG TGG TTG GCG TCC 
Pro Ala Ala Gly Glu Ser Asp Pro Gin Val Asp Ala Trp Leu Ala Ser 

180 185 190 



AAG TTC TAT GGG ACC GGC TAT GGC CAA GCC CTC GGA CCG GTC CAA GGC 
Lys Phe Tyr Gly Thr Gly Tyr Gly Gin Ala Leu Gly Pro Val Gin Gly 

260 265 270 



Fig. 3B 
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GCC TTC CCA TCT GTC ACC GCG CAG CTC AAC GCT GCA GCG CCT GGT GCC 728 
Ala Phe Pro Ser Val Thr Ala Gin Leu Asn Ala Ala Ala Pro Gly Ala 
195 200 205 

AAT CTC ACA GAC GCC GAC GCC TTC AAC CTC GTC AGC CTG TGT CCC TTC 776 
Asn Leu Thr Asp Ala Asp Ala Phe Asn Leu Val Ser Leu Cys Pro Phe 
210 215 220 

ATG ACA GTT TCG AAG GAG CAG AAG AGC GAC TTC TGC ACG TTG TTC GAG 824 
Met Thr Val Ser Lys Glu Gin Lys Ser Asp Phe Cys Thr Leu Phe Glu 
225 230 235 240 

GGA ATC CCT GGA TCG TTC GAG GCG TTT GCC TAT GCC GGC GAC CTT GAC 872 
Gly He Pro Gly Ser Phe Glu Ala Phe Ala Tyr Ala Gly Asp Leu Asp 

245 250 255 



920 



GTC GGC TAC ATC AAC GAG CTC CTT GCA CGC CTG ACC AAC TCC GCA GTG 968 
Val Gly Tyr He Asn Glu Leu Leu Ala Arg Leu Thr Asn Ser Ala Val 
275 280 285 

AAC GAC AAC ACA CAG ACG AAC CGC ACA CTC GAC GCC GCA CCA GAC ACG 1016 
Asn Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp Ala Ala Pro Asp Thr 
290 295 300 

TTC CCG CTC AAC AAG ACC ATG TAC GCC GAT TTC TCA CAC GAC AAC CTC 1064 
Phe Pro Leu Asn Lys Thr Met Tyr Ala Asp Phe Ser His Asp Asn Leu 
305 310 315 320 

ATG GTC GCC GTG TTC TCC GCC ATG GGC CTC TTC CGC CAA TCC GCA CCG 1112 
Met Val Ala Val Phe Ser Ala Met Gly Leu Phe Arg Gin Ser Ala Pro 

325 330 335 
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CTC AGC ACG TCC ACA CCG GAT CCG AAC CGC ACG TGG CTC ACG AGC TCT 1160 
Leu Ser Thr Ser Thr Pro Asp Pro Asn Arg Thr Trp Leu Thr Ser Ser 

340 345 350 



GTC GTT CCG TTC TCC GCG CGC ATG GCC GTG GAA CGC CTC AGC TGT GCT 1208 
Val Val Pro Phe Ser Ala Arg Met Ala Val Glu Arg Leu Ser Cys Ala 
355 360 365 

GGT ACC ACG AAG GTG CGC GTC CTG GTG CAG GAC CAG GTC CAG CCA CTC 1256 
Gly Thr Thr Lys Val Arg Val Leu Val Gin Asp Gin Val Gin Pro Leu 
. 370 375 380 



GAG TTC TGC GGC GGC GAC CAG GAT GGG TTG TGC GCG CTA GAC AAG TTC 1304 
Glu Phe Cys Gly Gly Asp Gin Asp Gly Leu Cys Ala Leu Asp Lys Phe 
395 390 395 400 

GTC GAG AGC CAG GCG TAT GCA CGG AGT GGT GGC GCA GGT GAC TTT GAG 1352 
Val Glu Ser Gin Ala Tyr Ala Arg Ser Gly Gly Ala Gly Asp Phe Glu 

405 410 415 

AAG TGT CTT GCG ACG ACG GTG TGAGATGGGG TAATCTACGG TGAAGCAGCG 1403 
Lys Cys Leu Ala Thr Thr Val 

420 

GAGAGCCTCT CAACGAATGC AAAGGATAGG TTCGAGGCTT ACTTCATCAA CCTATATCAT 1463 



CATAGGACAA GCCCCCCAAT AGCCAGACTC GTCGTTTGAC ATCGTGTATG AAAATAACCC 1523 
ACCCACGCAC TCCGCTGCCA CTATTCGCGT GTATCGCATA CTAGGCGTTT TCGCCCAGTT 1583 
GAACATGAGC CCATTCTGTC CCCAGTGAAA AAAAAAAAAA AAAAAATTCC TGCGGCCGC 1642 



Fig. 3C 
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{2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trametes pubescens 

(B) STRAIN: CBS 100232 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:79. .1407 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 130. • 1407 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 79. .129 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGATCCGAAT TCGCCCCCAC ATTCGTTCCA TCTTAGCAGC CGTCCGCGCC CAGGTCTTCG 60 

ATAACCCCCC GCGTGACT ATG GCC TTC TCA ATC TTG GCC TCG CTG CTC TTC 111 

Met Ala Phe Ser lie Leu Ala Ser Leu Leu Phe 
-17 -15 -10 

GTG TGT TAT GCA TAC GCC AGG GCT GTG CCC CGT GCA CAT ATC CCG CTC 159 
Val Cys Tyr Ala Tyr Ala Arg Ala Val Pro Arg Ala His He Pro Leu 
-5 1 5 10 

CGC GAC ACC TCC GCG TGT CTA GAT GTA ACA CGC GAT GTG CAG CAG AGC 207 
Arg Asp Thr Ser Ala Cys Leu Asp Val Thr Arg Asp Val Glri Gin Ser 

15 20 25 

TGG TCC ATG TAC TCT CCC TAT TTC CCG GCA GCA ACT TAT GTG GCT CCG 255 
Trp Ser Met Tyr Ser Pro Tyr Phe Pro Ala Ala Thr Tyr Val Ala Pro 

30 35 40 

CCC GCG AGT TGC CAG ATC AAT CAG GTC CAC ATC ATC CAA CGT CAT GGT 303 
Pro Ala Ser Cys Gin He Asn Gin Val His He lie Gin Arg His Gly 
45 50 55 

GCA CGC TTT CCC ACG TCT GGC GCA GCA AAG CGC ATC CAG ACA GCA GTA 351 
Ala Arg Phe Pro Thr Ser Gly Ala Ala Lys Arg He Gin Thr Ala Val 
60 65 70 

GCG AAG CTG AAG GCC GCG TCC AAC TAC ACC GAT CCC CTG CTC GCG TTC 399 
Ala Lys Leu Lys Ala Ala Ser Asn Tyr Thr Asp Pro Leu Leu Ala Phe 
75 ~ 80 85 30 

GTT ACG AAC TAC ACC TAC AGC TTA GGT CAG GAC AGC CTC GTT GAA CTC 447 
Val Thr Asn Tyr Thr Tyr Ser Leu Gly Gin Asp Ser Leu Val Glu Leu 

95 100 105 



Fig. 4A 
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GGT GCG ACT CAG TCC TCC GAA GCG GGC CAG GAG GCA TTC ACG CGG TAC 495 
Gly Ala Thr Gin Ser Ser Glu Ala Gly Gin Glu Ala Phe Thr Arg Tyr 

110 115 120 

TCA TCC CTC GTG AGC GCG GAC GAG CTT CCC TTC GTT CGG GCG TCG GGC 543 
Ser Ser Leu Val Ser Ala Asp Glu Leu Pro Phe Val Arg Ala Ser Gly 
125 130 135 

TCA GAT CGC GTC GTT GCG ACT GCC AAC AAC TGG ACT GCA GGT TTC GCG 591 
Ser Asp Arg Val Val Ala Thr Ala Asn Asn Trp Thr Ala Gly Phe Ala 
140 145 150 

CTT GCG AGC TCA AAC AGC ATC ACG CCC GTG CTC TCA GTC ATC ATT TCC 639 
Leu Ala Ser Ser Asn Ser lie Thr Pro Val Leu Ser Val lie He Ser 
155 160 165 170 

GAA GCG GGC AAT GAC ACC CTC GAC GAC AAC ATG TGC CCC GCT GCA GGC 687 
Glu Ala Gly Asn Asp Thr Leu Asp Asp Asn Met Cya Pro Ala Ala Gly 

175 180 185 

GAT TCG GAT CCC CAG GTC AAT CAA TGG CTC GCG CAG TTC GCA CCG CCG 735 
Asp Ser Asp Pro Gin Val Asn Gin Trp Leu Ala Gin Phe Ala Pro Pro 

190 195 200 

ATG ACT GCT CGC CTC AAC GCA GGC GCG CCC GGC GCG AAC CTC ACG GAC 783 
Met Thr Ala Arg Leu Asn Ala Gly Ala Pro Gly Ala Asn Leu Thr Asp 
205 " 210 215 

ACG GAC ACC TAC AAC CTG CTC ACG CTA TGC CCG TTC GAG ACT GTA GCC 831 
Thr Asp Thr Tyr Asn Leu Leu Thr Leu Cys Pro Phe Glu Thr Val Ala 
220 225 230 

ACC GAG CGG CGT AGT GAA TTC TGC GAC ATC TAC GAG GAG CTG CAG GCG 879 
Thr Glu Arg Arg Ser Glu Phe Cys Asp He Tyr Glu Glu Leu Gin Ala 
235 240 245 250 

GAA GAC GCC TTC GCG TAC AAT GCC GAT CTC GAC AAG TTC TAC GGC ACT 927 
Glu Asp Ala Phe Ala Tyr Asn Ala Asp Leu Asp Lys Phe Tyr Gly Thr 

255 260 265 

GGA TAC GGC CAG CCC CTC GGA CCC GTG CAA GGC GTC GGG TAC ATC AAC 975 
Gly Tyr Gly Gin Pro Leu Gly Pro Val Gin Gly Val Gly Tyr He Asn 

270 275 280 

GAG CTC ATC GCG CGC CTC ACC GCG CAG AAC GTG TCC GAC CAC ACG CAG 1023 
Glu Leu He Ala Arg Leu Thr Ala Gin Asn Val Ser Asp His Thr Gin 
285 290 295 

ACG AAC AGC ACA CTC GAC TCC TCG CCC GAG ACG TTC CCG CTC AAC CGC 1071 
Thr Aan Ser Thr Leu Asp Ser Ser Pro Glu Thr Phe Pro Leu Asn Arg 
300 305 310 

ACG CTC TAC GCG GAC TTC TCG CAC GAC AAC CAG ATG GTC GCG ATC TTC 1119 
Thr Leu Tyr Ala Asp Phe Ser Hia Asp Asn Gin Met Val Ala He Phe 
315 320 325 330 

TCG GCC ATG GGT CTC TTC AAC CAG TCC GCG CCG CTC GAC CCG ACG ACG 1167 
Ser Ala Met Gly Leu Phe Asn Gin Ser Ala Pro Leu Asp Pro Thr Thr 

335 340 345 



Fig. 4B 
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CCC GAC CCC GCG CGC ACG TTC CTC GTC AAG AAG ATC GTG CCG TTC TCC 1215 
Pro Asp Pro Ala Arg Thr Phe Leu Val Lys Lys lie Val Pro Phe Ser 

3S0 * 355 360 

GCG CGC ATG GTC GTC GAG CGC CTC GAC TGC GGC GGT GCG CAG AGC GTG 1263 
Ala Arg Met Val Val Glu Arg Leu Asp Cys Gly Gly Ala Gin Ser Val 
365 370 375 

CGC CTG CTC GTG AAC GAC GCA GTG CAG CCG CTG GCG TTC TGC GGG GCG 1311 
Arg Leu Leu Val Asn Asp Ala VaX Gin Pro Leu Ala Phe Cys Gly Ala 
380 385 390 

GAC ACG AGC GGG GTG TGC ACG CTG GAC GCG TTT GTC GAG AGC CAG GCG 1359 
Asp Thr Ser Gly Val Cys Thr Leu Asp Ala Phe Val Glu Ser Gin Ala 
395 400 405 410 

TAC GCG CGG AAC GAT GGC GAG GGC GAC TTC GAG AAG TGC TTC GCG ACA 1407 
Tyr Ala Arg Asn Asp Gly Glu Gly Asp Phe Glu Lys Cys Phe Ala Thr 

415 420 425 

TAGTTCCAGG TGTAGATACC CGGGGAAGAT GTACTCTCTA GACACCTCGC ATGTACTTAT 14 67 

CGATTAGAAA GAGACCCTGG CTGCTCTGCC CTCAAAAAAA AAAAAAAAAA AAAAAATTCC 1527 

TGCGGCCGC 1536 



Fig. 4C 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1501 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<il) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Agrocybe pediades 

(B) STRAIN: CBS 900.96 

<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 17 • .1375 

<ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 17. .94 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 95. .1375 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGATCCGAAT TCACTT ATG TCC CTC TTC ATC GGC GGC TGT TTG CTC GTG 4 9 

Met Ser Leu Phe He Gly Gly Cys Leu Leu Val 
-26 -25 -20 

TTT TTA CAG GCG AGC GCA TAC GGC GGC GTC GTG CAG GCC ACA TTC GTG 97 
Phe Leu Gin Ala Ser Ala Tyr Gly Gly Val Val Gin Ala Thr Phe Val 
-15 -10 -5 1 

CAG CCG TTT TTC CCT CCA CAG ATT CAG GAC TCT TGG GCA GCT TAT ACA 145 
Gin Pro Phe Phe Pro Pro Gin He Gin Asp Ser Trp Ala Ala Tyr Thr 

5 10 15 

CCA TAT TAT CCT GTT CAG GCG TAC ACG CCT CCC CCG AAG GAT TGC AAG 193 
Pro Tyr Tyr Pro Val Gin Ala Tyr Thr Pro Pro Pro Lys Asp Cys Lys 
20 25 30 

ATC ACA CAA GTT AAC ATT ATT CAA CGA CAT GGT GCC CGC TTT CCG ACA 241 
He Thr Gin Val Asn He He Gin Arg His Gly Ala Arg Phe Pro Thr 
35 40 45 

TCG GGG GCA GGC ACA AGG ATC CAA GCA GCT GTG AAG AAG CTT CAA TCA 289 
Ser Gly Ala Gly Thr Arg He Gin Ala Ala Val Lys Lya Leu Gin Ser 
50 55 .60 65 

GCT AAA ACC TAT ACG GAT CCT CGT CTC GAC TTT CTG ACC AAC TAT ACC 337 
Ala Lys Thr Tyr Thr Asp Pro Arg Leu Asp Phe Leu Thr Asn Tyr Thr 

70 75 80 

TAT ACC CTT GGT CAC GAC GAT CTC GTA CCG TTT GGA GCG CTT CAA TCA 385 
Tyr Thr Leu Gly His Asp Asp Leu Val Pro Phe Gly Ala Leu Gin Ser 

85 90 95 



Fig. 5A 
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TCA CAA GCT GGA GAG GAA ACG TTT CAA CGA TAC TCG TTT CTG GTG TCC 
Ser Gin Ala Gly Glu Glu Thr Phe Gin Arg Tyr Ser Phe Leu Val Ser 
100 * 105 HO 

AAA GAG AAC TTA CCT TTT GTA AGA GCT TCG AGT TCfc AAT CGA GTC GTC 
Lys Glu Asn Leu Pro Phe Val Arg Ala Ser Ser Ser Asn Arg Val Val 
115 120 125 

GAC TCA GCT ACC AAC TGG ACG GAA GGT TTT TCT GCG GCC AGT CAC CAC 
Asp Ser Ala Thr Asn Trp Thr Glu Gly Phe Ser Ala Ala Ser His His 
130 135 140 145 

GTC TTG AAT CCC ATT CTC TTT GTA ATC CTC TCA GAA AGT CTC AAT GAC 
Val Leu Asn Pro lie Leu Phe Val He Leu Ser Glu Ser Leu Asn Asp 

150 155 160 

ACG CTT GAC GAT GCC ATG TGC CCT AAC GCG GGC TCC TCC GAC CCG CAG 
Thr Leu Asp Asd Ala Met Cys Pro Asn Ala Gly Ser Ser Asp Pro Gin 

165 " 170 175 

ACT GGT ATC TGG ACC TCG ATA TAC GGG ACG CCT • ATT GCC AAC CGA CTA 
Thr Gly He Trp Thr Ser He Tyr Gly Thr Pro He Ala Asn Arg. Leu 
180 185 190 

AAT CAG CAG GCT CCG GGT GCA AAT ATT ACA GCT GCC GAT GTG TCG AAC 
Asn Gin Gin Ala Pro Gly Ala Asn He Thr Ala Ala Asp Val Ser Asn 
195 200 205 

CTT ATA CCG CTT TGC GCA TTC GAG ACG ATA GTA AAG GAG ACG CCA AGT 
Leu He Pro Leu Cys Ala Phe Glu Thr He Val Lys Glu Thr Pro Ser 
210 215 220 225 

CCT TTC TGT AAT TTG TTC ACC CCC GAA GAG TTC GCA CAG TTT GAA TAT 
Pro Phe Cys Asn Leu Phe Thr Pro Glu Glu Phe Ala Gin Phe Glu Tyr 

230 235 240 

TTC GGT GAC CTG GAC AAG TTC TAT GGG ACA GGT TAT GGA CAA CCG TTA 
Phe Gly Asp Leu Asp Lys Phe Tyr Gly Thr Gly Tyr Gly Gin Pro Leu 

245 250 255 

GGA CCT GTG CAA GGT GTC GGC TAC ATC AAT GAA CTT CTT GCC CGA CTC 
Gly Pro Val Gin Gly Val Gly Tyr He Asn Glu Leu Leu Ala Arg Leu 
260 " 265 270 

ACA GAA ATG CCA GTT CGA GAT AAC ACC CAG ACG AAC AGG ACA CTC GAC 
Thr Glu Met Pro Val Arg Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp 
275 280 28S 

TCT TCT CCG CTT ACA TTT CCC CTC GAC CGC AGT ATC TAC GCT GAC CTC 
Ser Ser Pro Leu Thr Phe Pro Leu Asp Arg Ser He Tyr Ala Asp Leu 
290 295 300 305 

TCG CAC GAT AAC CAA ATG ATC GCG ATA TTT TCA GCG ATG GGT CTT TTC 
Ser His Asp Asn Gin Met He Ala He Phe Ser Ala Met Gly Leu Phe 

310 315 320 

AAC CAG AGT TCA CCT TTG GAT CCG TCC TTC CCC AAC CCC AAG CGT ACT 
Asn Gin Ser Ser Pro Leu Asp Pro Ser Phe Pro Asn Pro Lys Arg Thr 

325 330 335 
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TGG GTC ACC AGT CGG CTT ACG CCT TTC AGC GCG AGA ATG GTC ACT GAG 
Trp Val Thr Ser Arg Leu Thr Pro Phe Ser Ala Arg Met Val Thr Glu 
340 345 350 

CGG TTG CTG TGT CAA AGG GAT GGG ACA GGG AGC GGT GGA CCA TCC AGG 
Arg Leu Leu Cys Gin Arg Asp Gly Thr Gly Ser Gly Gly Pro Ser Arg 
355 360 365 

ATC ATG CGG AAT GGA AAT GTG CAG ACG TTT GTG AGG ATT CTT GTC AAC 
lie Met Arg Asn Gly Asn Val Gin Thr Phe Val Arg lie Leu Val Asn 
370 375 380 385 

GAT GCT TTA CAG CCT TTG AAG TTC TGC GGA GGG GAC ATG GAT AGT TTG 
Asp Ala Leu Gin Pro Leu Lys Phe Cys Gly Gly Asp Met Asp Ser Leu 

390 395 400 

TGT ACT CTG GAA GCG TTC GTC GAG AGC CAG AAG TAT GCA CGA GAG GAT 
Cys Thr Leu Glu Ala Phe Val Glu Ser Gin Lys Tyr Ala Arg Glu Asp 

405 410 415 

GGT CAA GGC GAT TTT GAA AAA TGT TTT GAT TAAATATTGC AGTATGCTCA 
Gly Gin Gly Asp Phe Glu Lys Cys Phe Asp 
420 425 



1153 



1201 



1249 



1297 



1345 



1395 



GTGAGTAGAC TACAGTGCAG GCCCTGTAAC TCTTGTATTG TGTTTCTGGA ATTCCTCGGA 1455 
GCGTAGTTTG TAGCAAAAAA AAAAAAAAAA AAATTCCTGC GGCCGC 1501 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peniophora lycii 

(B) STRAIN: CBS 68 6.96 

(ix) FEATURE: 

(A) NAME /KEY: sigjpeptide 

(B) LOCATION : 123 . • 212 

(ix) FEATURE: 

(A) NAME/KEY:. xnat_j>eptide 

(B) LOCATION: 213. .1439 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION:123. .1439 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGATCCGAAT TCCATCTTCT GCTCTGACCT CCATCTCGCT GAGCGGCCGA CGAGAACCTA 60 

GGGGCTCTAA GTCCACGTAC TATCGCCGCG CCTGTGAAGG CCCCATACCA GCCCTTATCG 120 

AT ATG GTT TCT TCG GCA TTC GCA CCT TCC ATC CTA CTT AGC TTG ATG 167 
Met Val Ser Ser Ala Phe Ala Pro Ser lie Leu Leu Ser Leu Met 
-30 -25 -20 

TCG AGT CTT GCT TTG AGC ACG CAG TTC AGC TTT GTT GCG GCG CAG CTA 215 
Ser Ser Leu Ala Leu Ser Thr Gin Phe Ser Phe Val Ala Ala Gin Leu 
-15 -10 -5 1 

CCT ATC CCC GCA CAA AAC ACA AGT AAT TGG GGG CCT TAC GAT CCC TTC 263 
Pro He Pro Ala Gin Asn Thr Ser Asn Trp Gly Pro Tyr Asp Pro Phe 

5 10 15 

■TTT CCC GTC GAA CCG TAT GCA GCT CCG CCG GAA GGG TGC ACA GTG ACA 311 
Phe Pro Val Glu Pro Tyr Ala Ala Pro Pro Glu Gly Cys Thr Val Thr 
20 * 25 30 

CAG GTC AAC CTG ATT CAG AGG CAC GGC GCG CGT TGG CCC ACA TCC GGC 359 
Gin Val Asn Leu He Gin Arg His Gly Ala Arg Trp Pro Thr Ser Gly 
35 40 45 

GCG CGG TCG CGG CAG GTC GCC GCC GTA GCG AAG ATA CAA ATG GCG CGA 407 
Ala Arg Ser Arg Gin Val Ala Ala Val Ala Lys He Gin Met Ala Arg 
50 55 60 65 

CCA TTC ACG GAT CCC AAG TAT GAG TTC CTC AAC GAC TTC GTG TAC AAG 455 
Pro Phe Thr Aso Pro Lys Tyr Glu Phe Leu Asn Asp Phe Val Tyr Lys 

70 75 80 



Fig. 6A 
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TTC GGC GTC GCC GAT CTG CTA CCG TTC GGG GCT AAC CAA TCG CAC CAA 
Phe Gly Val Ala Aso Leu Leu Pro Phe Gly Ala Asn Gin Ser His Gin 

85 90 95 

ACC GGC ACC GAT ATG TAT ACG CGC TAC AGT ACA CTA TTT GAG GGC • GGG 
Thr Gly Thx Asp Met Tyr Thr Arg Tyr Ser Thr Leu Phe Glu Gly Gly 
100 * 105 HO 

GAT GTA CCC TTT GTG CGC GCG GCT GGT GAC CAA CGC GTC GTT GAC TCC 
Asp Val Pro Phe Val Arg Ala Ala Gly Asp Gin Arg Val Val Asp Ser 
115 120 125 

TCG ACG AAC TGG ACG GCA GGC TTT GGC GAT GCT TCT GGC GAG ACT GTT 
Ser Thr Asn Trp Thr Ala Gly Phe Gly Asp Ala. Ser Gly Glu Thr Val 
130 135 140 I 45 

CTC CCG ACG CTC CAG GTT GTG CTT CAA GAA GAG GGG AAC TGC ACG CTC 
Leu Pro Thr Leu Gin Val Val Leu Gin Glu Glu Gly Asn Cys Thr Leu 

150 155 160 

TGC AAT AAT ATG TGC CCG AAT GAA GTG GAT GGT GAC GAA TCC ACA ACG 
Cys Asn Asn Met Cys Pro Asn Glu Val Asp Gly Asp Glu Ser Thr Thr 

165 * 170 175 

+ 

TGG CTG GGG GTC TTT GCG CCG AAC ATC ACC GCG CGA TTG AAC GCT GCT 
Trp Leu Gly Val Phe Ala Pro Asn lie Thr Ala Arg Leu Asn Ala Ala 
180 185 190 

GCG CCG AGT GCC AAC CTC TCA GAC AGC GAC GCG CTC ACT CTC ATG GAT 
Ala Pro Ser Ala Asn Leu Ser Asp Ser Asp Ala Leu Thr Leu Met Asp 
195 200 205 

ATG TGC CCG TTC GAC ACT CTC AGC TCC GGG AAC GCC AGC CCC TTC TGT 
Cya Pro Phe Asp Thr Leu Ser Ser Gly Asn Ala Ser Pro Phe Cys 

220 225 



Met 

210 215 



GAC CTA TTT ACC GCG GAG GAG TAT GTG TCG TAC GAG TAC TAC TAT GAC 
Asp Leu Phe Thr Ala Glu Glu Tyr Val Ser Tyr Glu Tyr Tyr Tyr Asp 

230 235 240 

CTC GAC AAG TAC TAT GGC ACG GGC CCC GGG AAC GCT CTC GGT CCT GTC 
Leu Asp Lys Tyr Tyr Gly Thr Gly Pro Gly Asn Ala Leu Gly Pro Val 

245 " ' 250 255 

CAG GGC GTC GGA TAC GTC AAT GAG CTG CTT GCA CGC TTG ACC GGC CAA 
Gin Gly Val Gly Tyr Val Asn Glu Leu Leu Ala Arg Leu Thr Gly Gin 
260 265 270 

GCC GTT CGA GAC GAG ACG CAG ACG AAC CGC ACG CTC GAC AGC GAC CCT 
Ala Val Arg Asp Glu Thr Gin Thr Asn Arg Thr Leu Asp Ser Asp Pro 
275 280 285 

GCA ACA TTC CCG CTG AAC CGT ACG TTC TAC GCC GAC TTC TCG CAT GAT 
Ala Thr Phe Pro Leu Asn Arg Thr Phe Tyr Ala Asp Phe Ser His Asp 
290 295 300 305 

AAC ACC ATG GTG CCC ATC TTT GCG GCG CTC GGG CTC TTC AAC GCC ACC 
Asn Thr Met Val Pro lie Phe Ala Ala Leu Gly Leu Phe Asn Ala Thr 

310 315 320 
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GCC CTC GAC CCG CTG AAG CCC GAC GAG AAC AGG TTG TGG GTG GAC TCT 1223 
Ala Leu Asp Pro Leu Lys Pro Asp Glu Asn Arg Leu Trp Val Asp Ser 

323 330 " 335 

AAG CTG GTA CCG TTC TCT GGA CAT ATG ACG GTC GAG AAG CTG GCA TGT 1271 
Lys Leu Val Pro Phe Ser Gly His Met Thr Val Glu Lys Leu Ala Cys 
340 345 350 

TCT GGG AAG GAG GCG GTC AGG GTG CTC GTG AAC GAC GCG GTG CAG CCG 1319 
Ser Gly Lys Glu Ala Val Arg Val Leu Val Asn Asd Ala Val Gin Pro 
355 360 365 

CTG GAG TTC TGC GGA GGT GTT GAT GGG GTG TGC GAG CTT TCG GCT TTC 1367 
Leu Glu Phe Cys Gly Gly Val Asp Gly Val Cys Glu Leu Ser Ala Phe 
370 375 380 385 

GTA GAG AGC CAG ACG TAT GCG CGG feAG AAT GGG CAA GGC GAC TTC GCC 1415 
Val Glu Ser Gin Thr Tyr Ala Arg Glu Asn Gly Gin Gly Asp Phe Ala 

390 395 400 

AAG TGC GGC TTT GTT CCG TCG GAA TAGCGGGAGA CCGTCTATGC TACACAGTAA 14 69 
Lys Cys Gly Phe Val Pro Ser Glu 

405 

TTGTGTACTC TATAGCACTG TAGCTGTACT TACAAGTCGT AGGGTACGAT CGTACTTACG 1529 

CTCGTTTATT GATCCTTCCT TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA ATTCCTGCGG 1589 

CCGC 1593 



Fig. 6C 
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g^cgacgaggcacaccaegcccg^cctcggcgggcccgagagggccgggetcggg-trega SO 
caa ggag acgggcgrccctscgggcgcggcrgcgggsgcgggrgctgccgcggacggtga 120 
ggagggggacgggc^gggcgcrgatgacggracgaatgcgaacggacacaggccgecgag 18Q 
c^^^^sxgttgcgttctaaccrttctLCcgcgxgggcgxgtacgtigtgggtgrgtacgrg^ 2 40 
ttgggggggggaargttcrtggtaat,tafcctttctaccectct£ctcs.rtccrc*attcr 300 
?ttcagcaggt:ataccccg£gcaagxgzacaggat:catgggacgggrgggcggac.ggacr 3 60 
* cct ctagaaggacggataaggaaaaaggggaaacacgaatatggcgcectgggtggcgc 420 
gccgagctggatgcctgacgccggtctggcaaacatrttccccccccagcacccaaccta 480 
^rt^cttgatagagrgtttcggggccaggcggcctgcgctigtgrrrtraccaatcaccaac 540 
tagtgctactactactattgeggc^gcegatgcagceg-egraccaaaaatgcegeggcat (00 
ctceattg^tacttgtagttncgatagatcaaeacttgggaggttgcgctgggctgctct 660 
gaaaccccfcctetcttgctgtiacgtaacgtatgtgcacagtargrcaccgacaaagacga "720 
ttgcatgcgcatcgttttttgttgtgxt:teaggccccgct:cgrgeceagggtataaaeae 780 
attgaagacracatatgcgcaagacgrcgacaccaacggggcccrgcagcegccgcaggc 8 40 
gcatgtcgtgartaataccacgcgcccgcgcaaaccagctagccgccgccctgrtttcacn 900 
cggttagagacggacaggtgagacgggtcfccggttaagcaagcaaactggaatgcaaggc 9 60 
tgaaggcgtaatctgcatagcgtggaaaxgagagggc^ccgcgggcagccaggaaggrga 10 20 
S^cgaaacgaggaaagaggcaccagaagctgncgccccga age gcccgcggrca cage tc 1080 
caggatcaagtacggatgtcccatgccaagctgctggcttcgaaagcgagtacggagtag 1140 
tgtccatfcgttcacgagggacccccaatg^gttagacatgcetgaatcaactfctgxccta 1200 
tttttggatttcaactgttrcrctcgacrgrgctcggxagcgaccatgccgcaaggcaca 12 60 
cfcaeatgttgtacaataaccafcacarcgaccfcfcccgtaggagrgctgaaatacccgacct 1320 
gctctctctagcaggxgcctaatggctttcgtgtaacrcgatcgaaacggarcagcaagt 1380 
ccatttgctgttggregagacgtacgatttacaaacacgtggagaggrgagccacagcga 1440 
taggcttctggaaggattcrggcg^cricggaaagagggccacrcgccecactaaceggcg 1500 
cegafccctgacatsgggctcgcagggggcttaagtgcacacfcacggagracggattacac 1560 
agtagegtatgggtgggggcgagtttitgggtggccttgrgtggggcfccaccggctgcctgt 1620 
tctciggggagtcttggc sytjc cgattggacccaccraaccacggg^ag^ct^ggeccggc 1 i 8 0 
canutcacaccgccczcatgcttcggagccagtcaggtyaggc^iggcaccacrcagrcagg 1740 
tacacacgtcgggc-icctcgatgctgggxgacatcgaggcgatactgcattccaactacg 1800 
gtcggcataggaggcatcceattctagagctgitctacgccggaacgxaacccgggataa 1860 
ceegggafcafccgctrccctgagcgagcgcgctgctgaggateatacaacccaacaaccga 1920 
egaeggtgeaaga agger gggggaaggaagaaatcaaggaaaaaaaaatagggggggrgg 1980 
ggacca agagaga aa gaa a ggagaa a a gggtgggggg a ggga agaga aa a aa aa aa egga 2040 
ggaatatggcgtcgctcttcgaccggrtccggaagggggcatctgggtacacata^gcac 2100 
ctctrtccgcacggcagggaeaeaaacc ggga gtgcagtcccaccgaticacgctgagcccg 2160 
cccgrtctccagacrccacggtcgcagaggaccagacgcgcggtgaagatgactggcctcg 2220 

M T G L G 5 

gagtgatggtggrgatggrcggcttcctggcgatcgcctctctrg-taagcagcgatcccag 2280 
VMVVHVGFX,AXA5X. 19 

OTS"fcccggtgtgcg£taaaagaaaaagcraacgccaccagacaatccgagtcccggecafc 2340 

Q S E £ R ? C 26 

gcgacaccccagacrtgggcttccagrigcgg-cacggccatittcccacttctggggccagt 2400 
DTPDIGFQCGTAISHFWGQr 46 

acccgccctacttctccgtgcccccggagccggatgcttcgatccccgacgact:gcgagg 24 60 
SPTFS- VPSELDASIPDDCEV 66 

tgacgcttgcccaagtcctcrcccgccacggcgcgagggcgccgacgctcaaacgggccg 2520 
TFAQViSRHGARAFTLKRAA 86 

« 

cgagctacgtegacercatcgacaggatccaccatggcgccatcfccccacgggccgggc* 2580 
S TVDLID R Z H H G A I S Y G P G T 106 

acgagtccctcaggacgtacgaccacaccctgggcgccgacgagcccacecggacgggcc 2640 
EFLRtrXOYTLGADELT R T G Q 126 

agcagcagatggccaactcgggcaccaagrtttaccgccgccaccgcgctctcgcccgca 2700 
QQMVHSGI KrTRRrRALARK14 6 
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agtcgatccccttcgtccgcaccgccggccaggaccgcgtcgtccactcggccgagaacc. 27 60 
S IP F V R T A G Q D RVVH S A E H F 166 

tcacccagggcttccactczgccctgctcgccgaccgcgggtccaccgtccggcccaccq 2820 
T QG F H S A I* LA D R G • S T V R ? T L 18 6 

tcccctatgacatggtcgtcafccccggaaaccgeeggcgccaacaacacgctccaeaacg 2880 
P rDMVVIP ETAGAKNTLKND 206 

acccctgcaccgcctccgaggaaggcccgxactcgaccatcggcgacgacgcccaagaca 2940 
L C ? A F S E GP YS T I GD DAQDT22 6 

cctacctctccacctrcgccggacccatcaccgcccgggrcaacgccaacctgccgggcg 30 00 
YLSTFAGP ITARVNANLPGA 246 

ccaaccrg.acegacgccgacacggrcgcgctgatggacct:ctgccccr.tcgagacggt:cg 30 50 
HLTDADTVAX MDI.CPFETVA 266 

cctcctcctcccccgacccggcaacggcggacgcggggggcggcaacgggcggccgctgr 3120 
SSSSDPATADAGGGKGRPLS 286 

cgcccttctgccgcctgttcagcgagtccgagrggcgcgcgtacgactacctgcagtcgg 3180 
PFCRLFS E SEWRAYD^LQSV 306 

tgggcaagtggtacgggtacgggccgggcaacccgctggggccgacgcagggggtcgggi: 3240 
GX. HTGYG-PGKP LGPT.QGVG.F 326 

tcgtcaacgagctgctLggcgcggctggccggggcccccgtgcgcgacggcaccagcacca 3300 
VKELLARLAGVPVRDGTSTN 346 

accgcaccctcgacggcgacccgcgcaccctcccgctcggccggcccctictacgccgact: 3360 
RT2.DGD P RTFP !• G R P I» ¥ A D F 366 

tcagccacgacaacgacatgatgggcgtLcctcggcgccctcggcgcctacgacggcgtcc 3 420 
SKDNDM«GVLGALGATDGVP 386 

cgcccctcgacaagaccgcccgccgcgacccggaagagctcggcgggtacgcggccagct 3480 
P LDKTARRD P EELGGrAASW 406 

gggccgtcccgttcgccgccaggatctacgtcgagaagatgcggtgcagcggcggcggcg 3540 
AVP FAAR ITVE KHRCSGGGG 426 

gcggcggcggcggcggcgaggggcggcaggagaaggatgaggagatggrcagggtgctgg 3 600 
GGGGGEGRQE KDEEMVRVLV446 

tgaacgaccgggtgatgacgctgaaggggtgcggcgccgacgagagggggatgtgtacgc 3 660 
KDRVMT LKGCGADERGMCTL 46o 

tagaacggctcatcgaaagcacggcgcttgcgagggggaacggcaagtgggatctctgct 3720 
ERFIESMAFARGKGKWDLCF486 

ttgcttgatatgcccacgcccgagattgaacagaacttgtgatgggggtagagrgtggta 3780 
A 457 

tccgagatgacagrccacagnttccgggaatcaaaaatcggntagactggcgaaactcaa 38 40 
gtctggggcctgcggcgcccgcatcctccgtcccctgrtigttacctccttaatggctctt 3900 
tcttattttttacctccctcaaattcccacacaaacctcctatcgtcticcccccctccct 3950 

txtcttcttctrgcacatcggacgggaattgticgac 
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l AGATTCAACGACC<L^ 50 

M V T 3 



1 TCTGA^i. i. kCCUrs-i Z u <-GGCGGCGTATC l Us. l i iLiwGgcgagzggccc 100 
LTrLLS A A Y L L S G IS 



10 1 ggacccaccgczcggacagggccgcggcgCwgaccccgaaacggagTAGA 150 

R 17 



151 GTGTCTGCGGCACCrAGTTCTGCTGGCTCCAAiTrCCTGCGATACGGTAGA 200 
VSAAPSSAGSKSCDT-VD 34 



201 CCTCGGGTACCA^GCTCCCCTGCCACTTCrCATCTATGGGGCCAGTACT 250 
LGYQCSPATSKLWGQYS 51 



2 S 1 CGCCA~CTTT7CGCrCGAGGACGAGCICTCCGTGTCGACTA«GCTTCCC 300 
PFFSLEDELSVSSKLP- 67 



30 L AAGGATtGCCGGATCAC C I l ACAGGTGCTATCGCGCCATGGAGCGCG 350 
KDCRITLVQVLSRKGAR 84 



351 GTACCCAACCAGCTCC^AGAGCAAAAAGTATAAGAAGCTTGTGACGGCGA 4 00 
YPTSSKS KKYK2CLVTAX 101 



401 TCCAGGCCAATGCCACCGACTTCAAjC^CA^ 4 50 

Q A M A T D F KGKFAFLKT 117 



451 TACAAC7ATACTCTGGGTGCGGATGACCTC?oCTCCCTTTGGGGAGCAGCA 5 00 
YNYTLCAO D LT P'FGEQQ 134 



501 GCTGGTGAACTCGGGC\TCAAGTTCTACCAGAGGTACAAGGCTCTGGCGC 5 50 
LVNSGI :<? YQRYKALAR 1S1 



551 CCAGTGTGG7GCLU 1 1 t. ATTCGCGCCTCAGGCTCGGACCGCGTTATTGCT 500 
SVV? FIR ASGSDRVIA lo7 
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601 7CGGGAGACAAGTTCATCGAGGC47rrCCAG^ fi 50 
SGE KFI EGFQQAKLA-D? 134 



651 TC<3CGCGACGAACCGCGCCGCTCCGGCGATTAGTGTGATTA~CCGGAGA 700 
GATNR AA?AISVIIPES 201 



7 Q 1 GCGAGACGTTCAACAATACGCrGGACCACGG tX3 TG rGCACCAACTTTGAG 7 S 0 
ETFNNTLD KGVCTKFS 217 



751 GCGAGTCAGCTGC<I^GATGAC<7rTC A 0 0 

A* S Q L G . D EVAAN FTAL FA 234 

B Q 1 ACCCGACATCCGAGCTCGCGCCGAGAAGCATCTTCCTGGCGTGACGCTGA 8 S 0 

PDIRARAEKHLPCVTLT 251 



851 CAGACGAGG A CG i TGTC ^^T 7 "^- ATGGACATGTGTTCGTTTGATAggzXa 90° 
DEDVVS LMDMCScDTV 267 



301 ccgcgc accagcg ACGCAA <rrrjkGcrcrrr &r rrrrrrrcrrcAACTCTrCAC 9S0 

ARTSDASQLSPFCQLFT 284 



951 TC ACAATT? AGTGG AAG AAGT A C1AACT ArCTTCACTTrCTTGGGCAAGT ACT 1QG0 
HNEWiCKY NYLQSLGKYY 301 



1001 ACGGCXACGGCGCAGGCAACCCTCTGGGACCGGCTCAGGGGATAGGGTTC 1050 
GYGAG MP LGPA-QG IGF 317 



105 1 ACCAACGAGCTGAT7GCCCGGTTGACTCGTTCGCCAGTGCAGGACCACAC 1100 
TNELI ARLTRSPVQDKT 334 



1101 CAGCACTAACrCGACrCTAGTCrCCAACCCGGCCACCTTCCCGTTGAACG 1150 
STHSTLVS NPATFP LNA 351 



1151 CTACCVrGTACGTCGA C l , , l C^CACGACAACA£C^TCCTTTCCVrCrTC 1200 
TMYVDFSKDNSMVSIF 357 



• -* 

1201 TTTGCATTGCCCCrGTAC 2 wACC^CACTGAACCCTCGTCCCCCAC 1250 
FALGLYNCTEPLSRTSV 384 
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• • w m 

1251 C^AAPJ^GCCAAGGAATTCGAT^^ 1300 
ESAKELDGYSASWVVPF 401 



• • # • • 

1301 TCGGCGCC<TGAGCCTACITCGAGACGATGCAATGCAA^ 1 3 S 0 

G A R A Y F ETMQC K S E KS 417 



1351 CClLl rG 1 4.CGCGC 1 1 i UVrrAATGACCGGGTTGTGCCACTGCATGGCTG 1400 

PLVRALINDRVVPLKGC 434 

• • • • 

1401 CGATGTGGACAAGCrGGGGCGATGCAAGCrGAATGA Li CTG I 'CA^GGGAT 1450 

DVOKLGRCKLNDFVKGL 451 

■ 

• • * • ■ 

1*51 TGAG 1 1 CiGGCCAGATCTGGGGGCAACIGGGGAGAGTG C i ITAG I lUAGAT 1500 

SW.ARSGGNWGECFS 4S5 



1S01 GTCAj. ^ j. i ATGCTATACTCCAATAGACCGTTGCTTAGCCATTCACTrO*, 1550 
1551 CXTTCCrCGA^CCGCCTGCCG . 1571 
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MGVFvVLLS. iacLzgsTSG TALGprg..M hsCCTSVDrG YQCFPELSHk 
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MVgriAIaSL csE SRPCDTpOlG FQCgTAISHF 

MG FL- VLL5L GYLL — VSAG PPVG N SKSCDTVDGG YQCFPEISHL 

MGVfWLLS. XATI*rCSTAG YALGPRG. .N SHSCDTVDGG YQCFFEISEL 
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A. cerrsus 9A-1 
A. cerreus cbs 
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MALT 3h In 3? 
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AOLPGVTLTO EDW-LMOMC PFETVARTS DATELSPFCA 

ADLPGVTLTD EDWYXWDMC PFETVARTS 
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LsTQeEWqaY 
LFSEsEWraY 



DYLQSL5KYY 
OYYQSLGKYY 
DYLQSVGKWY 



GYGAGSPLGP 
GnGGGNPLG? 
GYGPGNPLG? 



AQGIGFtNEL 
AQGVGFvNEL 
TQGVGFvNEL 



IARLTQS PVQ 
IARMTHSPVQ 
LARiAgvPVR 



LFTH-EW-OY DYLQSLGKYY GYGAGNPLGP AQGVGF-NEL IARLTRSPVQ 
LFTHDEWRQY DYLQSLGKYY GYGAGNPLGP AQGVGFANEL IARLTRSPVQ 



A. Carre us 9A-1 
A. cerreus cbs 



351 

DHTCVNNTLD 
OKTCVNNTLO 



ASPATFPLNA 
ANPATFPLNA 



TLYADFSaDS 
TLYAOFSHDS 



NLVSIFWALG 
NLV5IFWALG 



400 

LYNGTAPLSq 
LYNGTfcPLSc 



A. nig*er var. awamori 

A. niger T213 

A. nlqec NRRL3 135 



A. fumigacus 13073 

A. zumlgacus 32722 

A. fumtgacus 58128 

A. fumigacus 2690 6 

A. fumigatus 32239 

A. nldulans 

T. chermopiillus 

M, Chermophila 

Consensus 
Conphys 



ODTSSNHTLO 
ODTSSNHTLO 
0OTSSNHTLD 



SNPATFPLNS 
SNPATFPLNS 
SSPATFPLNS 



TLYADFSHDN 
TLYAOFSHDN 
TLYADFSHDN 



GIISILFALG 
GIISILFALG 
GIISILFALG 



LYNGTkPLST 
LYNGTkPLST 
LYNGTkPLST 



DHTSTNsTLv 
DHTSTNsTLv 
DHTSTNsTLv 
DHTSTNsTLv 
OKTSTNsTLD 



SNPATFPLNA 
SNPATFPLNA 
SNPATFPLNA 
SNPATFPLNA 
SDPATFPLNA 



TMYVDFSHDN 
TMYVDFSHDN 
TMYVDFSHDN 
TMYVDFSHDN 
TIYVDFSHDN 



SMVS I FFALG 
SMVSIFFALG 
SMVS I FFALG 
SMVSIFFALG 
GMIPIFFAMG 



LYNGTEPLSr 
LYNGTGPLSr 
LYNGTEPLSr 
LYNGTEPLSr 
LYMGTEPLSq 



ONTSTNHTLD 
DYTTVNHTLD 
OgTSTNRTLD 



SNPATFPLDr 
SNPATFPLNA 
GDPrTFPLGr 



KLYADFSHDN 
TLYADFSHDN 
PLYADFSHDN 



SMISIFFAMG 
TMTSIFaALG 
DMMGVLgALG 



LYNGTQPLSm 
LYNGTAkLST 
aYDGVPPLDK 



DHTSTNHTLD SNPATFPLNA TLYAOFSHDN SMI SI FFALG LYNGTAPLST 
DHTSTNHTLD SNPATFPLNA TLYADFSHDN SMI SI FFALG LYNGTAPLST 



A. Carr&us 9A-1 
A. Cerreus cbs 



A. nlger 
A. nlger 
A. niger 



var. 
T213 
NRRL3135 



A. fumigacus 13073 

A. fumigacus 32722 

A. fumigacus 58128 

A. fumigacus 26906 

A. fumigacus 32239 

A. ciidulans 

T. thermophilus 

M. checmoohlla 



Consensus 
Conpfaya 



401 ^° 

TSVESVSQTD GYAAAWTVPF AARAYVEMMQ C RAEKEP 

TTVEDITrTD GYAAAWTVPF AARAYIEMMQ C RAEKQP 

awamori TTVENITQTD GFSSAWTVPF ASR1YVEMMQ C QAEQE? 

TTVENITQTD GFSSAWTVPF ASR1YVEMMQ C QAEQE P 

TTVENITQTD GFSSAWTVPF ASR1YVEMMQ C QAEQE? 

TSVESaKElO GYSASWWPF GARAYFECMQ C KSEKEP 

TSVESaKElD GYSASWWPF GARAYFEtMQ C ? '• r f S ^f Z? 

TSVESaKElO GYSASWWPF GARAYFEtMQ C KSEKES 

TSVESaKElD GYSASWWPF GARAYFEtMQ C KSEKEP 

TSeESTKESN GYSASWAVPF GARAYFEtMQ C KSEKEP 

DSVESIQEmD GYAASWTVPF GARAYFELMQ C E.KKEP 

TEIKSIEETO GYSAAWTVPF GGRAYIEMMQ C DOSOE? 

TArrDpEEIG GYAASWAVPF AARiYVEKMR Csgggggggg gegrQEKDEe 

TSVESIEETO GYSASWTVPF GARAYVEMMQ C QAEKE? 

TSVESIEETD GYSASWTVPF GARAYVEMMQ C QAEKE P 
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A. c&rrsus 9A-1 
A. csrreus cbs 



451 

LVRVLVNDRV 
LVRVLVNDRV 



M?LHGC?TDK 
M?LHGCAVDN 



LGRCKrQAFV 
LGRCKrDDFV 



AGLSFAQAGG 
EGL5FARAGG 



S0O 

NWADCc 

NWAZCc — 



A. nig-er var. awaaiori 

A. nig-er T213 

A. niyer NRRL3135 



LVRVLVNDRV 
LVRVLVNDRV 
LVRVLVNDRV 



V?LKGC?I0a 
VELKGCPIDa 
VPLHGCFVDa 



LGRCTrDScV 
LGRCTrDScV 
LGRCTrDScV 



rGLSFARSGG 
rGLSFARSGG 
rGLSFARSGG 



A. r"iuni > g'acus 13073 

A. funiigacirs 32722 

A. jfujnigraeus S8128 

A. jfuoigacus 26906 

A. fumigattxs 32239 



LVRALINORV 
LVRALINORV 
LVRALINORV 
LVRALINORV 
•LVRALINDRV 



VPLKGCOVDK 
V?LHGCOVDK 
VPLHGCDVDK 
VPLKGCDVDK 
VPLHGCAVDK 



LGRCKLMDFV 
LGRCKLMDFV 
LGRCKLMOFV 
LGRCKLMOFV 
LGRCKLKDFV 



A. nldulans 

T. checmophilus 

M. thermophlla 

Consensus 
Conphya 



KGLSWARSGG 
KGLSWARSGG 
KGLSWARSGG 
KGLSWARSGG 
KGLSWARSGG 



LVRVLVNDRV 
WRVLVNDRV 
MVRVLVNDRV 



VPLHGCAVDK 
V-LHGCEVDS 
MTLJcGCGADE 



FGRCTLOOWV 
LGRCKrDDFV 
rGMCTLEr FI 



EGLNFARSGG 
rGLSFARcGG 
ESMAFARGNG 



DWAECsA 

DWAZCFA 

DWAECFA 



N*WG EC FS 

NWGZCFS 

NWGECFS 

NWGECFS 

NSEQSF5 

MVIJcTCFTl — 
NWEGCYAase 
KWD1CFA 



LVRVLVNDRV VPLHGCAVDK LGRCK-DDFV EGLSFARSGG NWAZCFA- 
LVRVLVNDRV VPI^iGCAVDK LGRCKRDDFV EGLSFARSGG NWAECFA 
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CP-1 

XAXAXGAAITCAXGGGCGXGXXCGXCGXGCXACXGXCCAXXGCCACCXXGXXCGGXXCCA 
1 + + ^ + + + g 0 

AT AT AC T T AAGT ACCCGC ACAAGCAGCACGATGAC AGGT AACGGTGGAACAAGC C AAGGT 

CAXCCGGXACCGCCXXGGGICCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 
61 + + + + + + 120 

GTAGGCCAXGGCGGAACCCAGGAGCACCAXTAAGAGX 
CP-2 

CP-3 

GTXACCAATGXXXCCCAGAAATXXCXCACXIGXGGGGXCAAIAGXCICCAXACXXCXCXX 
121 -r + + + + + 180 

CAAXGGXXACAAAGGGXCXXXAAAGAGXGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

XGGAAGACGAAXCXGCXATXTCTCCAGACGXTCCAGACGACTGTAGAGTTACTTTCGXTC 

181 + h + + +• -+ 240 

ACCTTCTGCTTAGACGAXAAAGAGGXCXGCAAGGXCXGCXGACAXCXCAATGAAAGCAAG 

CP-4 

CP-5 

AAGTTTTGTCTAGACACGGTGCXAGAXACCCAACTTCXTCTAAGXCTAAGGCXXACXCXG 
241 + + + + + + 300 

TXGAAAACAGATCTGTGCCACGAXCXArGGGXXGAAGAAGATTCAGATTCCGAATGAGAC 

CXXTGAXTGAAGCTATTGAAAAGAACGCXACIGCXTTCAAGGGTAAGTACGCTTTCTTGA 
301 + + + + + +. 36Q 

GAAACTAACTTCGATAAGTTXTCTTGCGAXGACGAAAGXXCCCAXXCAXGCGAAAGAACX 

CP-6 

CP-7 

AGACTTACAACTACACTTTGGGTGCXGACGACXTGACXCCAXXCGGXGAAAAGCAAAXGG 
361 + + + + H -h 4 20 



XXAACXCXGGTAXTAAGXXCXACAGAAGATACAAGGCXXXGGCXAGAAAGAXIGXTCCAT 

421 + + + + + . + 4 80 

AAXTGAGACCATAATTCAAGATGXCTTCTATGTTCCGAAACCGAICXTXCTAACAAGGTA 

CP-8 

CP-9 

TCATTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCXGAAAAGXICAXXGAAGGIX 

481 + + + + + 540 

AGXAAXCXCGAAGACCAAGACXGXCXCAAXAACGAAGACGACXTXXCAAGXAACXXCCAA 
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XCCAAXCXGCTAAGXXGGCXGACCCAGGXXCTCAACCACACCAAGCXXCXCCAGXTAXXG 
541 + + + + + + 600 

AGG TT AG ACGATTCAACCGAC TGGGTCO AAGAGT TGGT GTGGT T CGAAGAGGTCAATAAC 

CP-10 

CP-11 

ACGXTATTATTCCAGAAGGaTCcGGTTACAACAACACTTTGGACCACGGTACTTGTACXG 
601 + + + -r + -r 660 

TGCAAIAAJ!AAGGXCXXCCtAGgCCAAIGXXGXXGXGAAACCXGGXGCCAXGAACAIGAC 

CXXXCGAAGACXCTGAA2TGGGXGACGACGXXGAAGCTAACXXCACXGCXXXGXICGCXC 
661 + + + h +■ 720 

GAAAGCTXCTGAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12 

CAGCTAXTAGAGCXAGATTGGAAGCTGACTTGCCAGGTGTTACTXTGACTGACGAAGACG 
721 + + + + + +. 780 

GTCGAIAATCTCGATCTAACCTTCGACTGAACGGTCCACAAXGAAACTGACTGCTXCTGC 
CP-13 

XXGXXXACXXGAXGGACATGTGXCCAXXCGAAACXGXXGCXAGAACXXCXGACGCXACXG 
781 + + + + + + 840 

AACAAAXGAACTACCXGTACACAGGTAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

AAXXGXCXCCATTCXGXGCTTXGTXCACXCACGACGAATGGAGACAAXACGACTACXXGC 

841 + + -j- + + + 900 

TTAACAGAGGXAAGACACGAAACAAGXGAGXGCXGCXXACCXCXGXTAXGCIGATGAACG 
CP-14 

CP-15 

AATCXXXGGGXAAGXACXACGGXTACGGXGCXGGXAACCCAITGGGXCCAGCXCAAGGXG 
901 + + + + + +. 960 

XTAGAAACCCAXXCAXGAXGCCAAIGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

XXGGXXTCGCXAACGAATXGATXGCTAGAXIGACTAGATCTCCAGXTCAAGACCACACXT 

961 + + -i- + + + 1020 

AACCAAAGCGAXXGCXTAACTAACGATCTAACXGAICXAGAGGTCAAGXXCXGGXGXGAA 

CP-16 

CP-17 

CTACTAACCACACTTTGGACXCTAACCCAGCTACXXXCCCATXGAACGCXACXXXGXACG 

1021 + + + + t* + 1080 

GAIGATXGGXGX GAAACCXGAGAXXGGGXCGAIGAAAGGGTAACT TGCGATGAAAC AXGC 
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CXGACXXCXCXCACGACAACTCXAXGAXXXCTAXXXXCIXCGCXITGGGTTTGTACAACG 
1031 + -r -r + + +■ 1140 

^CTGAJVGAGAGTGCTGTTGMATACTAAAGAIAA 

CP-13 

CP-19 

GTACTGCTCCATTGTCTACTACTTCTG7TGAAXCTAXXG 
1141 + + + + + + 1200 

CATGACGAGGTAACAGAXGAIGAAGACAACXTJ^ 

CXXCXXGGACXGXXCCAXXCGGXGCT1AGAGCXTACGXXGAAAXGAXGCAAXGXCAAGCTG 
1201 + + + + + + i2S0 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTXGXXCCATXGCACGGTXGIG 
1261 - : + + + + + ^ 1320 

XXXXCCXXGGTAAGCAAXCXCAAAACCAAITGCXGXCXCAACAAGGTAACGXGCCAAGAC 

CXGXXGAGAAGXTGGGTAGAXGXAAGAGAGAGGAGXXCGXXGAAGGITXGXCTXXCGCXA 
1321 * + + + + + 1380 

GACAACXGTTCAACCCATCTACATTCTCTCTGCTGAAGCAJVCTTCCAAACAGAAAGCGAX 

CP-22 

GAXCXGGTGGTAACTGGGCTGAATGTTTCGCT TAAGAAXTCATATA 
1381 + + + + 1426 

CIAGACCACCAXXGACCCGACXXACAAAGCGAAXTCXTAAGXAXAX 
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1 ^^^^^CGATAGCGGACCGACrAGGCATCGTTCATCCACAATATCTCA SO 
XQl GGCCGTCrAGwTCGGGCrCCCGGGATCAGGAGGAGC^GGCT CU tU i 1 iC AT ISO 



151 TTCGwiCAIGG^l I i. t UC^CG G TCGClCiVi X GCT^ 200 



MAFFTVALSLYYLLS 



" • • 

25 1 ccgcacagAGTCTCTGCTe^CGCCCC^ 

VSAQ APVV QNHSCN 



Qvspy FSIEQESAISE 



EAIQKMAT-S FWGQYAF 



L £ S Y N V T L G ADDLTIFG 



15 



- ■ • • . 

201 CC^Ggcgagacccccacaacacccgcccgcc cage ccaac cggcacccac 250 
R 



IS 



300 
30 



301 TACGGCGGACGGrGGATATCAATGCTTCCCCAArG CCTCXCAroTTTGGG 350 
TADGGYQCFPKVSKVWG 



47 



351 GTCAGTACTCGCCGTACXTCTCCATCGAGCAGGAGTCAGCTATCrC^ 400 



63 



4 a 1 GAGGTGCCTCATGGCTGTCAgGTTAC L. I i. 11* CG CAGG TGC TC ICU-UiCA 450 

DV PKGCEVTFVQVLSRK 80 

GARYPTES KS KAYSGLI 37 

501 TTGAAGCAMTCCAGAAGAATGCTACCrCTTTITGG S50 



113 



• .... 
CTGGAG; ^^ATAACrAT^ g o 0 



130 



"2. CCL*GAACCAGATGGrra\rTC^ g S 0 

SNQMVOSGAKFYRRYKN 147 

_ * 

5=1 ATCTCGCCAGGAAA^TA^ 700 

LARJCNT? F I RASGS DR 1£3 
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70 1 CTCGTTKCTCTGa^!^^ 7S 0 

VVA SAE K T I N G F R K A Q L 130 

• • • • • 

75 1 CCACGACCATGGCTCCAAACGTGCTACGCCA G I i.G I'CAATGTGATTATCC 800 

KOKGSKRATPV VNVI I? 137 

i 

• • • a • 

E I D W $ tX T L D KST CVS 213 



831 TTLT^GJU^XGATGACKXX^ 900 
FEND E' RAD £ X E A N F T A I 230 



901 TATCGGACCTCCGATCCGCAAACGTCTGGAAAATGACCrCCCroGCATCA 9 S 0 

MGPPrRKRLENDLPGIK 247 

• • • • * 

9 S t AACITACAAACGAGAATGTAATATAT^ 1000 

LTNENVIYX.MOM CS FD 253 



1001 ACCATCrr^^rCjCJ^Ct^rCCACGGAA.CCGAGC'i I * i C i CCATTTTglt^VT 1050 
TMARTAKGTELS P FCAI 280 



• • • * • 

1051 CTTCAC^AAAAGGAGraS^ 1100 

ftekewlqydylqsl.sk: 297 

• • — • • • 

1101 £SSA£^KGCTACGGTGCCC^^ 1150 

YYGYCACS PLGPAQGI 313 



• • • • * 

1151 GGCTTCACCAAaSACXTTGA^ 1200 
G FTNEt lARLTQSPVQD 330 



1201 CAACACAAGCACCAACCACACTCTAGACTCGAACCCAGCCACATrrCC^ 12 SO 
NTSTNHTLDSN9ATFPL 347 



• • • • * 

12 S 1 TCGACAGGAAGCTCTACGCCGA L E .1 LC CCACGACAArAGCATGArATCG 1 3 Q 0 
D R K L Y AD F. S K 0 „S M ^1 S 353 

- 

1301 ATATTCT7CGCC\TGC<?TCT^^ ^50 
I FFAMGLYNGTQ9LSMD 330 
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1351 TTCCGrGGASTCCATCCAGGAG ATGGACGGTTACGCGG CU i L a. tU GACTG 1400 
S V E S I Q EM DG.YAASWTV 357 



1401 TTCCCi - a. LLrLx lUCGAGGCCXTA^ j. x LT^dGCTCAXGCAGTGGGAGAAGAAG 1450 
P rGARAYFELMQCEcCK 413 



1431 CAGCCC L i id I G CGGGTATTAGTGAATGATCGC C Z rGTXCCTCI 1C ATGC 1500 
EPLVRVLVNDRVVPI-HG 43 0 



1501 CTGCGCAGTTGACAA G Z I rG GACGGTCCA CI 1 P5 GACG AT xA GACC 1550 
CAVDKFG°'RCTLDDWVSG 447 



1551 Cl 1 iGA Ax n ZC CAAGGAGCGGCGGGAACTOGAAGA C rTGTTX 1 A CCCTA 1600 
LNFARSGCNWKTCFTL 463 



1501 TAAAGGGLU iuwLi LAZTCAXAAk EG LTG L^CACGTATAGGAAGCTTAG 1650 

• • ■ • • 

IS 51 GGAATT AG C iuu IG G C j. I xA CTCZTATTAGACCAAGAAZG Aa. 1 i.G x TIG 1700 
. » • » • « 

17 0 1 ITClTAAGGCCrrC^AGCAXATCGTCA^ 1750 

• • • • • 

175 1 CA1L, aACCTGAACCCCw ^wCATCAACLxUi. lUxO l I'lCAGAGTAG 18 00 

• • • • • 

1801 TrrCACCARACATATCCT CG ' iG ' i 'CCTCr L ITC iUC TCTTCGwTCTCATAT 13 SO 

• • • • • 

18 S 1 TACAL iUw iUi.L i ATCTATATCGTCAACAAAACTACCACCCAAACACCAA 1900 

• • • 

190 1 ATGTCACACrTTCCAGGACGAAATTTCTTCG 1931 
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1 AICQGCGI CrCltiC l unWAUim.UTlX^^ 
23 MCVS A VLLP L Y L L S GVTS G L 



61 GCAGTCCCCGCCrCGAGAMTCAATCCAGTTGCQATACGGTCGATCAGGGGIAICAATGC 
AVPASRNQSS CDTVDQOYQC 
-1 *1 

121 XTCICCGAGACTTCGC ATCTTTC GGGTCAA1ACGCACC G T TC 
18 FSETS HLWGQYAPFFSLANE 



181 TCGGTCATCTCCCCTQAGGTGCCCGCrGGATGCAGA^ 
38 SVISPBVPAGCRVTFAQVLS 



24l CGTCATGQAQCGCGGrATCCGACCQACTCCAAGGGCAAGAAATACTCCGCTCTCATTQAG 
58 RRGAB YPTD S K G K K Y SAL I E 



301 GAGATCCAGCAGAACGCGACCACCTTTGACGGAAAATATGCCTTCCTGAAGACATACAAC 
78 EZQQN.ATTF D GKY A FLKT Y H 



361 TACAQCTIt3GQTQCAGA!IGACCIGACTO 
98YSLGADDI.TPFGBQBX.VNSO 



421 ATCAAGrrCTACCAQCGGTACOAATCGCTGACAAGOAACATCQTTCCATIGATCCGATCC 
118 IKFYQRYES LTRNZVPFZRS 



481 TCIG0CTCCAGO»CCnCUiTC0CClCCaG0AAGA^ 

138 SGSSRVZASGKKFZEGFQST 



541 MGCTGAAGOATarrCOTGCCCAGCCCGGCCAATC 

158 KLK'DFRAQP GQSSPKZDVVZ 



601 TCCGAGGCCACXTTCATCCAACAACACTtTrc 

178 SEASSSNNTLDPGTCTVFED 



661 AGCGAATTGC^GATACCGTCGAAGCCAATTTCAC 

198 S E L A D*s£T VBAKFTATFVPSZR 



721 CAACGTCTGGAG^CGACCTGTCCGGTGTGACTCTCACAGACACAQAAGTOACCTACCTC 
218 QRLBND LSGVTLTDTEVTYL 



781 AIXj^CAlU'lUJlCCll C QACACCATCTCCACCAGCACCGTCGACACCAAGCTGTCCCCC 
238 MDKCSFDTZSTSTVDTKLSP 



Fig. 11A 
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84l TTCTGTCACCTCnTC^CCCA 

258 FCDLFTHDEWI NYDYLQSLK 



901 AAGTATTACGGCCATGGTGCAGGTAACCCGCTCGCX^^ 

278 KYYGHGACNPLGPTQCVGYA 



961 MCGAGCTCATCGCCCGTCTGACCC^ 

298 NE LIARLTHSFVHDDTSSNH 



1021 ACTTTGGACTCGAGCCCGGCTACCTITCCGCTCAACTCTACTC^ 
318 TLDSS ?AT FPI*NST I*YADFS 



1081 CATGACAACGGCATCATCTCGATTCTCTT^ 
338 HDNGI X S X L F A L G L " Y * N GTK P 



ll4l CTATCTACCACGACCGIXK3AGAATATCACCCAGACAGATC 
358 LSTTTVENITQTDGFSSAWT 



1201 GITCCG TITUCTI XX ^ 
378 VFFASRLYVEMMQCQAEQEP 



1261 CTGKaTCCCflXjUVlUl j ^^^ 
398 LVRVLVNDRVVPLHGCPVDA 



1321 TIXKSGGAGATGTACCCWGGATAGCTrTGr^ 
418 LGRCTRDSFVRGLS FARSGG 



1381 GArnXSXJGGAGTGTTTTGCTTAG 
438 DWAECFA* 
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tctagaacaataacaggtactccctaggtacecgaaggaccttgtggaaaatgtatggag 60 

gtggacacggcaccaaccaccacccgcgatggcgcacgtggtgccctaaccccttgctcc 120 

ctcaggatggaatccatgtcgactctttaccctcaceafccgcctggatgaaacctccccg 180 

ctaagctcacgacgatcgctatttccgaccgatttgaccgtcatggtggagggctgattc 240 

ggtcgatgctcctgccttcatttcggagt tcggagacatgaaaggcttatatgaggacgt 300 

cccaggtcggggacgaaatccgccctgggctgtgctccttcgtcggaaacatctgctgtc 360 

cgtgatggctaccatgggctttcttgccattgtgctctccgtcgccttgctctttagaag 420 

MGF LA I V L ' S V A L L F R S 16 

gtatgcacccctctacgtccaattctctgggcactgacaacggcgcagcacatcgggcac 480 

T S G T 20 

cccgttgggcccccggggcaaacatagcgactgcaactcagtcgaticacggctatcaatg 540 

PLGPRGKHSDCNSVDHGYQC 40 

ctttcctgaactctctcataaatggggactctacgcgccctacttctccctccaggacga 600 

F P ELSHKWGLYAP Y F S L Q D E 60 

* 

gtctccgt/ttcctctggacgtcccagaggaetgtcacatcaccttcgtgcaggtgct.ggc 6 60 

SP FPLDVPE DCHITFVQVL A 80 

■ 

c cgcca cggcgcgcggagc cc a a cccatagca agaccaaggcgt acgcggc gac ca 1 1 gc 720 
RHGARSPTHSKTKAYAATIA 100 

ggccatccagaagagtgccactgcgt tt ccgggcaaatacgcgttcctgcagtcat at aa 780 
AIQXSATAFPGKYAFLQSYN 120 

ctactccttggactctgaggagctgactcccfctcgggcggaaccagctgcgagatctggg 840 
YSLDSEELTPFGRNQLRDLG 140 

cgcccagttctacgagcgctacaacgccctcacccgacacatcaacccct tcgtccgcgc 900 
AQFYERYNALTRHINPFVRA 160 

caccgafcgcafccccgcgtccacgaatccgccgagaagtfccgfccgagggcttccaaaccgc 960 
TDASRVKESAEKFVEGFQTA 180 

tcgacaggacgatcatcacgccaatccccaccagccfctcgcctcgcgtggacgtggccat 1020 
RQDDHHANP HQ P SPRVDVA I 200 

ccccgaaggcagcgcctacaacaacacgctggagcacagcctctgcaccgcct t cgaatc 1080 
PEGSAYKHTLEHSXCT AF ES 220 

cagcaccgtcggcgacgacgcggtcgccaacttcaccgccgtgttcgcgccggcgatcgc 1140 
S TVGDDAVANF TAVFAP AI A 240 

ccagcgcctggaggccgatcttcccggcgtgcagctgtccaccgacgacgtggtcaacct 1200 
QRLEADLPGV.Q LS TDDVVNL 260 

gatggccatgtgtccgttcgagacggtcagcctgaccgacgacgcgcacacgctgtcgcc 12 60 
MAMCPFETVS LT0DAHTLSP 280 

gttctgcgacct ettcacggccactgagtggacgcagtacaactacctgct ctcgctgga 132 0 
FCDLFTATEWTQYNYLLSLD 300 

caagtactacggctacggcgggggcaatccgctgggtccggtgcagggggtcggctgggc 138 0 
KYYGYGGGNP LGPVQGVGWA 320 

gaacgagctgatggcgcggctaacgcgcgcccccgtgcacgaccacacetgcgtcaacaa 1440 
HELMARLTRAP VHDHTCVNN 340 

caccctcgacgcgagtccggccaccttcccgctgaacgccaccctctacgccgacttctc 1500 
TLDASPATFPLNATLYADFS 360 
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ccacgacagcaacctggt.grcgatcttietgggcgctgggcctgtacaacggcaccgcgcc 1560 
HDSNLVSIFWALG2-.YNGT.AP 380 

get gtcgcagacctccgtcgagagcgtct cccagacggacgggt acgccgccgcctggac 1620 
LSQTSVESVSQTDGYAAAWT 400 

ggtgccgt tcgccgctcgcgcgt aegt cgagatgatgcagtgtcgcgccgagaaggagcc 1680 
VPFAARAYVEMMQCRAEKEP 420 

gctggtgcgcgtgctggtcaacgaccgggtcatgccgctgcatggctgccctacggacaa 1740 
IiVRVLVNDRVMP IiHGCPTDK 440 

gctggggcggtgcaagcgggacgctttcgtcgcggggct gagctttgcgcaggcgggcgg 1800 
LGRCKRDAFVAGIj SFAQAGG 460 

gaactgggcggattgtttctgatgttgagaagaaaggtagatagataggtagtacatatg 18 60 
N W A D C F 466 

gattgctcggctctgggtcgttgcccacaatgcatattacgcccgtcaactgccttgcgc 1920 
catccacctctcaccctggacgcaaccgagcggtctaccctgcacacggettccaccgcg 1980 
acgcgcacggataaggcgcttttgttacggggttggggctgggggcagccggagccggag 2040 
agagagaccagcgtgaaaaacgacagaacat:agat:at:caattcgacgccaatt.catgcag 2100 
agtagtatacagacgaactgaaacaaacacat:cactt:ccctcgc"tcct:ctcctgtagaag 2160 
acgc.tcccaccagccgcttcrggcccttattcccgtacgctaggtagaccagtcagccag 2220 
acgcatgcctcacaagaacgggggcgggggacacactccgctcgtacagcacccacgacg 2260 
tgtacaggaaaaccggcagcgccacaatcgtcgagagccatctgcag 2327 
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^ 3 - GACTGCGTACAAAGGCTACT ATGCCTTCTTGAAAGACTACAGATACCAGC 700 

* AYKGYYAFLKDYRYQL 120 

701 TGGGAGOGAACGACCTGAOGCCCTTTGGGGAAAACCAGATGATCCACTTG 750 

GANDLTOFGENQMIQL i3S 
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7 S 1 CGCATC^AGTTITAT^CCA— 800 
CXKFYNKYKSZ.ARNAV9 153 



801 ATTCGTTCGTTGCTCCGGCTCTGATCGGGTCATIGCCTCGGw 8 50 

FVRCSGSDRVZASG'RLF 170 



SSI TCATCGAAGCTTrCCAGAGCGCCAAAGTGCTC<^^ 500 
ZEGFQSAKVLDPK.SDK 135 



901 C^ra^CCCTCCTCCC^CGATC^^ 950 
KDAPPTZNVZ IEEGPSY 203 



9S1 CAATAACACGCTCGACACCGCCAGCTGTCCAGT L' L ' i ' i b AGGACAGCAGCG 1000 

NNTLDTGSC PVFEDSSG 220 

• • • • • 

1001 GGGGACATCACGCACAGGAAAACTrrra^ 10 so 

G K 0 A Q E KFAKQFAPAI 235 



1051 CTGGAAAAC^TCAAGGACCATCrrCCCGGCGTGGACCTGGCCGTGTCGGA 1100 
L E KI*KD KL P GVD LAVSD 2S3 



1101 TGTACCGTACTTGATGGAC 1 1G i G lCC L. l i. iCJ A GACCTTGGCTCGCAACC 1150 
•VPYLKDLC9 FETLARKK 270 



HS1 ACA&AGACACGCTGTCTCC G L tl L (jCGCTCTTTCC^C O f*AAG AGGAGTGG 1200 
TDTLS P FCALSTQ EEW 285 



1201 CAAGCATATGACTACTACCAAACTCT 1250 
QAYDYYQSLGKYYG tfGG 303 



125 1 GGGTAACCCGTTGGGGCCAGCCCAAGGCGTGGGGTITGTCAACGAGTTGA 1300 
GNPLGPAQGVGFVtrZLl 320 



1301 TTGCTCGCATGACCCATAGCCCTGTCQAGGACTACACC^CGGTCAACCAC 13 50 
ARMTKSPVQDYTTVtfK 3 35 



1351 ACTCrTGACrCGAATCCGGCGAC^TTCCCrrrGAACGCGACGCTGTACGC 14 00 
TLDStfPATF P L M A T I* Y A 353 
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140 1 AGATtTCAGCCACGACAACACAATCAOGTCAATTTTO 14 SO 

OFSHDNTMTS EFAALGL 370 

• 

14 SI TGTACAACEGGACCGO^^^ IS 00 

VHCTAKLSTTSIKSIE 33a 



1501 GAGACGGACGGCTACTCGGCGGOGTGGACCG7TCGGTTCGGGGGGCGAGC IS S 0 
ETDGYSAAWTV? rGGRA 403 

IS S 1 CTATATCCAGATT3AXGCAGTGTGA 1500 
YIEMMQCD 0SO£ PVVRV 420 

_ • 

1SQ1 'TGCTGGTCAAGGACCGGGTGGTGCCACTGCAXGGCT Iff 50 

LVNDRVVPLHGCEVDS 435 

* 

i 

1&S1 CaXXX3CGATGCAAACGACACX2ACI^^ 1700 
LGRCXRDDFVRCLSFAR 4S3 



1701 ACAGGGTGGGAACrGGGACGGGTGTTAOG C XKsL i rCTGAGTAGGTTTATT 17 S 0 
QGGMWEGCTAASE- 4S6 
• • 

17 s i CAca^TrrrcGAccTTTcrATccrrcAAAc^^ is o o 

1301 C^TGAAATCXTTAACAGCXrCrc^^ 18 4 S 
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AAGCTT GGGCAAACTCATCATGCTCATCTTGATGATTCCACTGTTCA 
80 

Hindlll MLI LMI PLFSYLAAASL 

CTTTGCCCCTCTCTCGATGTTAAAATACTAA 
160 

RVLSPQPVSCD 

GCCCGGAGCTTGGTTACCAATGCGACCAGCAGACAACGCACACCrGGGGTCAATACT 
240 

S PELGYQCDQQTTHTWG'QYSPFFSVPS 

GAGATCTCCCCTTCOSTTCCTGATGGCTGCCXrc^ 
320 

EISPSVPDGCRLTFAQVLSRHGARFPT 

CCCGGGTAAATCCGCCGCCATCTCCGCTGTCCTCACCAAAATCAAAACCTCTGCC^ 
400 

PGKAA AISAVLTKI KTSATWYGSDFQ 

TCATCAAGAACTACGACTATGTACTTGGCGTAGACCACCrGACCGCGT 
480 

FIKNYDYVLGVDHLTAFGEQEMVNSGI 

AAGTTCTACCAGCGCTACTCCTCCCTCATCCAGACAGAAGACTCGGATACGCTC 
560 

KFYQRYSSLI QTEDSDTLPFVRASGQE 

ACGCGTC^TCGCCTCCGCCGAGAACTTCACCACCGGCTTCT^ 
640 

RVIASAENFTTGFYSALSADKKPPSS 

TACCAAGACCAGAAATGGTCATCATTTCTGAGGAGCCAACAGCCAAC 
720 

LPRPEMVI I SEEPTANNTMHHGLCRSF 

GAAGATTCCACCACCGGCGACCAAGCCCAAGCGGAATTCATCGCCC^ 
800 

EDSTTGDQAQAEFIAATFPP ITARLNA 

CCAAGGTTTCAAAGGCGTCACCCTCTCCAACACCGACGTCCTATCACrAATGGACCT 
880 

QGFKGVTLSNTDVLSLMDLCPFDTVA 

ACCCCCTTTCCrCCCTCACCACCACCTCTTCCG^ 
960 

YPLSSLTTTSSVSGGGKLS PFCSLFTA 

AGCGACTGGACAATCTACGATTACCTCCAGTCCCTAGGGAAATACTACGGTTTCG 
1040 

SDWTIYDYLQSLGKYYGFGPGNSLAAT 

CCAGGGGGTAGGGTAOTTCAACX^GCTTATCGCCCGCrTGAT 
1120 

QGVGYVNELIARLIRAPVVDHTTTNS 

CTCTTGATGGCGACGAAAAAACGTTTCCGTTGAACAGAACGGTCT 
1200 

TLDGDEKTFPLNRTVYADFSHDMDMMN 
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ATCCTGACTGCTTTGCGGATA^^ 
1280 

IIjTALRIFEHISPMDNTTIPTNYGQTG 

AGATGACGGGGTGAAGGAAAGGGATTTGTTCAAGGTTAGTTGGGCGGTC 
1360 



DDGVKERDLPKVSWAVP FAGRVY 



P E K 



TGGTTTGTGATGCGGATGGGGATGGCAAGATTGATAGTGATGAGGCTC^ 
1400 

MVCDADGDGKIDSDEAQKELVRILVND 

CGGGTGATGAGArraAATGGGTGTGATGCTGATGAA 
1520 

RVMRLNGCDADEQGRCGLEKFVESMEF 

TGCGAGGAGAGGGGGGGAGTGGGAGGAGAGGTGTTTTGTTTAG OT 

ARRGGEWEERCFV JCbal 
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Peniophora numbers l 

Alignment numbers 1 

P_involtus_Al ML FGFVALACLL 

P_involtus_A2 MH LGFVTI1ACI1I 

T_pubescens MAFSILASLL 

Ajpediades MSLFIGGCLL 

P_lycii MV SSAFAPSILL 

A_fumigatus MVTL TFLLSAAYLL 

consphyA MGVF WLLSIATLF 

A_nidulans MAFF TVALSLYYLL 

A_f icuum_NRRL3135 MGVS AVLLPLYLLS 

A_terreus MGFL AIVLSVALLF 

T_thermo MSLL LLVLSGGLVA 

T_lanuginosa MAGIGLGS FL VLLLQFSALL 

M_thermophila MTGL GVMWMVGFL 

C_f oecundissimum ML ILMIPLFSYL 
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50 

SLSEVLATSV P KNT APTFPIPESE 

HLSEVFAASV P RNI APKFSIPESE 

FVCYAYARAV PRAHIPLRDT SACLDVTRDV 

VFLQASAYGG WQATFVQPF FPPQI 

SLMSSLALST QFSF V AAQLPIPAQN 

.SGRVSAAPS SAGSKSCDTV DLGYQCSPAT 
GSTSGTALGP RGNSHSCDTV DGGYQCFPEI 
. . SRVSAQAP WQNHSCNTA DGGYQCFPNV 
GVTS GLAVPA SRNQSSCDTV DQGYQCFSET 
RSTSGTPLGP RGKHSDCNSV DHGYQCFPEL 
LYVS. . .RNP HVDSHSCNTV EGGYQCRPEI 

TASPAIPPFW RKKHPNVD I 

AIASL QSESRPCDTP DLGFQCGTAI 

AAASL RVLSPSCDSP ELGYQCDQQT 

QPV 



P_invo 1 1 u s _A1 
P_involtus_A2 
T_pubescens 
A_jpediades 
P_lycii 
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A — nidulans 
A_f i cuum_NRRL3 135 
A_terreuB 
T_thermo 
T_lanuginosa 
M_thermophi 1 a 
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QRNWSPYSPY 
QRNWSPYSPY 
QQSWSMYSPY 
QDSWAAYTPY 
TSNWGPYDPF 
SHLWGQYSPF 
SHLWGQYSPY 
SHVWGQYSPY 
SHLWGQYAPF 
SHKWGLYAPY 
SHSWGQYSPF 
ARHWGQYSPF 
SHFWGQYSPY 
THTWGQYSPF 



FPLAEYKA. . 
FPLAEYKA. . 
FPAATYVA . . 
YPVQAYTP . . 
FPVEPYAA. . 
FSLEDELSVS 
FSLEDESAIS 
FSIEQESAJS 
FSLANESVIS 
FSLQDESPFP 
FSLADQSEIS 
FSLAEVSEIS 
FSVP . . SELD 
FSVP SETS 



. . PPAGCQIN 
. . PPAGCEIN 
. . PPASCQIN 
. . PPKDCKIT 
. .PPEGCTVT 
SKLPKDCRIT 
PDVPDDCRVT 
EDVPHGCEVT 
PEVPAGCRVT 
LD VP ED CHIT 
PDVPQNCKIT 
PAVPKGCRVE 
ASIPDDCEVT 
PSVPDGCRLT 



QVNIIQRHGA 
QVNIIQRHGA 
QVHIIQRHGA 
QVNIIQRHGA 
QVNLIQRHGA 
LVQVLSRHGA 
FVQVLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 
FVQVLARHGA 
FVQLLSRHGA 
FVQVLSRHGA 
FAQVLSRHGA 
FAQVLSRHGA 



83 
100 

RFPTSGATTR 
RFPTSGAATR 
RFPTSGAAKR 
RFPTSGAGTR 
RWPTSGARSR 
RYPTSSKSKK 
RYPTSSKSKA 
RYPTESKSKA 
RYPTDSKGKK 
RSPTHSKTKA 
RYPTSSKTEL 
RYPTAHKSEV 
RAPTLKRAAS 
RFPTPGKAAA 
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P_invol tus_A2 
T_pubescens 
Ajpediades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f i cuum_NRRL 3135 
A_terreus 
T_thermo 
T_lanuginosa 
M_thermophila 
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IKAGLTKLQG 
IKAGLSKLQS 
IQTAVAKLKA 
IQAAVKKLQS 
QVAAVAKIQM 
YKKLVTAIQA 
YSALIEAIQK 
YSGLIEAIQK 
YSALIEEIQQ 
YAATIAAIQK 
YSQLISRIQK 
YAELLQRIQD 
YVDLIDRIHH 
ISAVLTKIKT 



VQNFTDAKFN 
VQNFTDPKFD 
ASNYTDPLLA 
AKTYTDPRLD 
ARPFTDPKYE 
NATDFKGKFA 
NATAFKGKYA 
NATS FWGQ YA 
NATTFDGKYA 
SATAFPGKYA 
TATAYKGYYA 
TATEFKGDFA 
GAISYGPGYE 
SATWYGSDFQ 



FIKSFKYDLG 
FIKSFTYDLG 
FVTNYTYSLG 
FLTNYTYTLG 
FLNDFVYKFG 
FLKTYNYTLG 
FLKTYNYTLG 
FLESYNYTLG 
FLKTYNYSLG 
FLQSYNYSLD 
FLKDYRYQLG 
FLRDYAYHLG 
FLRTYDYTLG 
FIKNYDYVLG 



NSDLVPFGAA 
TSDLVPFGAA 
QDSLVELGAT 
HDDLVPFGAL 
VADLLPFGAN 
ADDLTPFGEQ 
ADDLTPFGEN 
ADDLTIFGEN 
ADDLTPFGEQ 
SEELTPFGRN 
ANDLTPFGEN 
ADNLTRFGEE 
ADELTRTGQQ 
VDHLTAFGEQ 
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150 

QSFDAGQEAF 
QSFDAGLEVF 
QSSEAGQEAF 
QSSQAGEETF 
QSHQTGTDMY 
QLVNSG I KFY 
QMVNSGIKFY 
QMVDSGAKFY 
ELVNSGIKFY 
QLRDLGAQFY 
QMIQLGIKFY 
QMMESGRQFY 
QMVNSGIKFY 
EMVNSGIKFY 



P_invol tus__Al 
P involtus A2 



134 176 
151 200 

ARYSKLVSKN NLPFIRADGS DRWDSATNW TAGFASA SHNTVQ 

ARYSKLVSSD NLPFIRSDGS DRWDTATNW TAGFASA SRNAIQ 
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QRYSFLVSKE 
TRYSTLFEGG 
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NHYKSL.ARN 
HRYREQ . ARE 
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ELPFVRASGS DRWATANNW 
NLPFVRASSS NRWDSATNW 
DVPFVRAAGD QRWDSSTNW 
WPFIRASGS DRVXAS GEKF 
IVPFIRASGS DRVIASAEKF 
NTPFIRASGS DRWASAEKF 
IVPFIRSSGS SRVTASGKKF 
INPFVRATDA SRVHESAEKF 
AVPFVRCSGS DRVTASGRLF 
IVPFVRAAGS ARVTASAEFF 
SIPFVRTAGQ DRWHSAENF 
TLPFVRASGQ ERVIASAENF 



TAGFAXiA . . . 

TEGFSAA. . . 

TAGFGDA. . . 

IEGFQQAKLiA 

IEGFQSAKLA 

INGFRKAQLH 

IEGFQSTKLK 

VEGFQTARQD 

IEGFQSAKVL 

NRGFQDAKDR 

TQGFHSALLA 

TTGFYSALSA 



SSNSIT 

. SHHVLN 

.... SGETVL 
DPGA. TNRAA 
DPGSQPHQAS 
DHGS . . KRAT 
DPRAQPGQSS 
DHHANPHQPS 
DPHSDKHDAP 
DPRSNKDQAE 
DRGSTVRPTL 
DKNPPSSLPR 
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T_lanuginosa 
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PKLNLILPQT 
PKLDLILPQT 
PVLSVTISEA 
PILFVILSES 
PTLQWLQEE 
PAISVIIPES 
PVIDVIIPEG 
PWNVIIPEI 
PKIDWTSEA 
PRVDVAIPEG 
PTINVIIEEG 
PVINVIISEE 
PYDMWIPET 
P.EMVIISEE 



G. .NDTLEDN 
G. . NDTLEDN 
G. . NDTXiDDN 
L. . NDTLDDA 
G. . NCTLCNN 
ETFNNTLDHG 

sgynntldhg 
dgfnntldhs 
sssnntldpg 
saynntlehs 
psynntldtg 
tgsnntldgl 

AGANNTTjHND 

ptanntmhhg 



MCPAAGD . . . 
MCPAAGE . . . 
MCPAAGD . . . 
MCPNAGS . . . 
MCPNEVD . . . 
VCTKFEA. . . 
TCTAFED . . . 
TCVSFEN. . . 
TCTVFED. . . 
LCTAFES . . . 
SCPVFED. . . 
TCPAAEE. . . 
LCTAFEEGPY 



. . SDPQVNA 
. . SDPQVDA 
. . SDPQVNQ 
• . SDPQTGI 
. . GD. ESTT 
SQLGDEVAAN 
SELGDDVEAN 
DERADEIEAN 
SELADTVEAN 
STVGDDAVAN 
SSGGHDAQEK 
AP.DPTQPAE 
STIGDDAQDT 
STTGDQAQAE 



217 
250 

WLAVAFPSIT 
WIiASAFPSVT 
WLAQFAPPMT 
WTSIYGTPIA 
WLGVFAPNIT 
FTAIiFAPD IR 
FTALFAPAIR 
FTAIMGPPIR 
FTATFVP S IR 
FTAVFAPAIA 
FAKQFAPAIL 
FLQVFGPRVL 
YLSTFAGPIT 
FIAATFPPIT 



218 252 
251 300 

*Linvoltus_Al ARLNAAAPSV NLTDTDAFNL VSLCAFLTVS KEKK S 

*Linvoltus_A2 AQLNAAAPGA NLTDADAFNL VSLCPFMTVS KEQK s 

Tjpubescens ARLNAGAPGA NLTDTDTYNL LTLCPFETVA TERR s 

A_pediadee NRLNQQAPGA NITAADVSNL IPLCAFETIV KETP S 

P_lycii ARLNAAAPSA NLSDSDALTL MDMCPFDTLS SGNA S 

A_fumigatus ARAEKHLPGV TLTDEDWSL MDMCSFDTVA RTSD. .ASQ LS 

consphyA ARLEADLPGV TLTDEDWYL MDMCPFETVA RTSD. - ATE LS 

A_nidulans KRLENDLPGI KLTNENVIYL MDMCS FDTMA RTAH. .GTE LS 

A_f i euum_NRRL3 135 QRLENDLSGV TLTDTEVTYL MDMCSFDTIS TSTV. .DTK LS 

A_terreus QRLEADLPGV QLSTDDWNL MAMCPFETVS LTDD. . AHT LS 

T_thermo EKIKDHLPGV DLAVSDVPYL MDLCPFETLA RNHT . . DT LS 

T_lanuginosa KKITKHMPGV NLTLEDVPLF MDLCPFDTVG SDPVLFPRQ LS 



M_thermophila ARVNANLPGA NLTDADTVAL MDLCPFETVA SSSSDPATAD AGGGNGRPLS 

ARLNAGFKGV TLSNTDVLSL MDLCPFDTVA YPLSSLTTTS SVSGGGK LS 
Q 



253 
301 

P_ involtus_Al DFCTLFEGIP GSFEAFAYGG 
P_involtUS_A2 DFCTLFEGIP GSFEAFAYAG 
Tjpubescens EFCDIYEELQ AE.DAFAYNA 
Ajpediades PFCNLFT . . P EEFAQFEYFG 
P_lycii PFCDLFT. .A EEYVSYEYYY 
A_fumigatus PFCQLFT. .H NEWKKYNYLQ 



300 
350 

DLDKFYGTGY GQELGPVQGV GYVNELIARL 
DLDKFYGTGY GQALGPVQGV GYINELLARL 
DLDKFYGTGY GQPLGPVQGV GYINELIARL 
DLDKFYGTGY GQPLGPVQGV GYINELLARL 
DLDKYYGTGP GNALGPVQGV GYVNELLARL 
SLGKYYGYGA GNPLGPAQGI GFTNELIARL 



Fig. 17B 
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consphyA 
A_nidulans 
A_f i cuum_NRRL3 135 
A_terreus 
T__thenno 
T_lanuginosa 
M_thermophi 1 a 



P_involtus_Al 
P_invol tus_A2 
T_jpubescens 
A_j>ediades 
P_lycii 
A_fumigatus 
consphyA 
A_jaidulans 
A__f i cuum_NRRL3 13 5 
A_terreus 
T_thermo 
T_lanuginosa 
M_thermophila 



P_involtus_Al 
P_involtus_A2 
T_pubescens 
A_ped±ades 
P_lycii 
A_fumigatus 
consphyA 
A_nidulans 
A_f icuum_NRRL3 13 5 
A_terreus 
T_thermo . 
T_lanugino s a 
M_thermophi 1 a 



P_involtus_Al 
P_involtus_A2 
Tjpubescens 
A_pediades 
P_lycii 
A_f umigatus 
consphyA 
A_nidulans 
A_f i cuum_NRRL3 135 
A_terreus 
T thermo 



PFCALFT . . H DEWRQYDYLQ SLGKYYGYGA GNPLGPAQGV GFANELIARL 
PFCAIFT..E KEWLQYDYLQ SLSKYYGYGA GSPLGPAQGI GFTNEL I ARL 
PFCDLFT..H DEWINYDYLQ SLKKYYGHGA GNPLGPTQGV GYANELIARL 
PFCDLFT . .A TEWTQYNYLL SLDKYYGYGG GNPLGPVQGV GWANELMARL 
PFCALST . . Q EEWQAYDYYQ SLGKYYGNGG GNPLGPAQGV GFVNELIARM 
PFCHLFT. .A DDWMAYDYYY TLDKYYSHGG GSAFGPSRGV GFVNELIARM 
PFCRLFS . . E SEWRAYDYLQ SVGKWYGYGP GNPLGPTQGV GFVNELLARL 
PFCSLFT A SDWTIYDYLQ SLGKYYGFGP GNSLAATQGV GYVNELIARL 

301 349 
351 400 
TNS.AVRDNT QTNRTLDASP VTFPLNKTFY ADFSHDNLMV AVFSAMGLFR 
TNS.AVNDNT QTNRTLDAAP DTFPLNKTMY ADFSHDNLMV AVFSAMGLFR 
TAQ.NVSDHT QTNSTLDSSP ETFPLNRTLY ADFSHDNQMV AIFSAMGLFN 
TEM . P VRDNT QTNRTLDSSP LTFPLDRSIY ADLSHDNQMI AIFSAMGLFN 
TGQ . AVRDET QTNRTLDSDP ATFPLNRTFY ADFSHDNTMV P I FAALGLFN 
TRS-PVQDHT STNSTLVSNP ATFPLNATMY VDFSHDNSMV SIFFALGLYN 
TRS.PVQDHT STNHTLDSNP ATFPLNATLY ADFSHDNSMI SIFFALGLYN 
TQS.PVQDNT STNHTLDSNP ATFPLDRKLY ADFSHDNSMI SIFFAMGLYN 
THS.PVHDDT SSNHTLDSSP ATFPLKSTLY ADFSHDNGI I SILFALGLYN 
TRA- PVHDHT CVNNTLDASP ATFPLNATLY ADFSHDSNLV SIFWALGLYN 
THS . PVQDYT TVNHTLDSNP ATFPLNATLY ADFSHDNTMT SIFAALGLYN 
TGNLPVKDHT TVNHTLDDNP ETFPLDAVLY ADFSHDNTMT GIFSAMGLYN 
A. GVPVRDGT STNRTLDGDP RTFPLGRPLY ADFSHDNDMM GVLGALGAYD 
I RAPWDHT TTNSTLDGDE KTFPLNRTVY ADFSHDNDMM NILTALRIFE 

350 383 

401 450 

QPAPLSTSVP NPWR T WRTSSLVPFS GRMWERLSC 

QSAPLSTSTP DPNR T WLTSSWPFS ARMAVERLSC . . . 

QSAPLDPTTP DPAR T FLVKKIVPFS ARMWERLDC 

QSSPLDPSFP NPKR T WVTSRLTPFS ARMVTERLLC QRDGTGSGGP 

ATA.LDPLKP DENR L WVDSKLVPFS GHMTVEKLAC 

GTEPLSRTSV ESAKE . . LDG YSASWWPFG ARAYFETMQC 

GTAPLSTTSV ESIEE. . TDG YSASWTVPFG ARAYVEMMQC 

GTQPLSMDSV ESIQE. .MDG YAASWTVPFG ARAYFELMQC 

GTKPLSTTTV ENITQ. .TDG FSSAWTVPFA SRLYVEMMQC 

GTAPLSQTSV ESVSQ. .TDG YAAAWTVPFA ARAYVEMMQC 

GTAKLSTTEI KSIEE. .TDG YSAAWTVPFG GRAYIEMMQC 

GTKPLSTSKI QPPTGAAADG YAASWTVPFA ARAYVELLRC ETETS SEEEE 
GVPPLDKTAR RDPEE..LGG YAASWAVPFA ARIYVEKMRC SGGGGGGGGG 
HISPMDQTGD DGVKE RDL FKVSWAVPFA GRVYFEKMVC DADGDGKIDS 
NTTIPTNYG 



FGT TKVRVLVQDQ VQPLEFCGGD RNGLCTLAKF VESQTFARSD 

AGT TKVRVLVQDQ VQPLEFCGGD QDGLCALDKF VESQAYARSG 

GGA QSVRLLVNDA VQPLAFCGAD TSGVCTLDAF VESQAYARND 

SRIMRNGNVQ TFVRILVNDA LQPLKFCGGD MDS LCTLEAF VESQKYARED 

SGK EAVRVLVNDA VQPLEFCGG . VDGVCELSAF VESQTYAREN 

K. .S . . .EKE PLVRALINDR WPLHGCDVD KLGRCKLNDF VKGLSWARSG 
Q. .A. . . EKE PLVRVLVNDR WPLHGCAVD KLGRCKRDDF VEGLS FARSG 

E KKE PLVRVLVNDR WPLHGCAVD KFGRCTLDDW VEGLNFARSG 

Q. .A. . . EQA PLVRVLVNDR WPLHGCPVD ALGRCTRDSF VRGLS FARSG 
R. .A. . .EKE PLVRVLVNDR VMPLHGCPTD KLGRCKRDAF VAGLS FAQAG 
D. .D. . . SDE PWRVLVNDR WPLHGCEVD SLGRCKRDDF VRGLS FARQG 
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T_lanuginosa E..G...EDE PFVRVLVNDR WPLHGCRVD RWGRCRRDEW IKGLTFARQG 
M_thermophila E. .GRQEKDE EMVRVLVNDR VMTLKGCGAD ERGMCTLERF IESMAFARGN 

D EAQK EIiVR I L VNDR VMRLNGCDAD EQGRCGLEKF VESMEFARRG 





426 


439 




501 


514 


P_involtus_Al 


GAGDFEKCFA 


TSA. 


P_involtus_A2 


GAGDFEKCIiA 


TTV. 


Tjsubescens 


GEGDFEKCFA 


T- . . 


A_pediades 


GQGDFEKCFD 


• • • • 


P_lycii 


GQGDFAKCGF 


VPSE 


A^fumigatus 


. . GNWGECFS 


• • • • 


consphyA 


. . GNWAECFA 


* 

• • * 


A_nidulans 


. . GNWKTCFT 




A_f icuum_NRRL3135 


. . GDWAECFA 


• • ■ • 


A_terreus 


. . GNWADCF . 


• • • « 


T_thermo 


. . GNWEGCYA 


ASE. 


T_l anugino s a 


. . GHWDRCF . 


• • • • 


M_thermophi la 


. . GKWDLCFA 
GEWEECFV 
R 


• # « • 



Fig. 17D 
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SEQUENCE LISTING 

<212> DNA 

<213> Cladorrhinum f oecundissimum 
<220> 

<221> intron 
<222> (71) . . (126) 



<220> 
10 <221> CDS 

<222> (20) . . (70) 

<220> 
<221> CDS 
15 <222> (127) . . (1563) 



<220> 

<221> sig_peptide 
<222> (20) . . (64) 

20 

<400> 1 

aagcttgggc aaactcatc atg etc ate ttg atg att cca ctg ttc age tac 52 

Met Leu He Leu Met He Pro Leu Phe Ser Tyx 
1 5 10 

25 

ctg get get get tct ctg tgggttcatc ctttgcccct gtctcgatgt 100 
Leu Ala Ala Ala Ser Leu 

15 



30 taaaatacta aacatatttc accaga cgt gta etc tec cct cag cca gtg tec 153 

Arg Val Leu Ser Pro Gin Pro Val Ser 
20 25 



tgt gac age ccg gag ctt ggt tac caa tgc gac cag cag aca acg cac 201 
35 Cys Asp Ser Pro Glu Leu Gly Tyr Gin Cys Asp Gin Gin Thr Thr His 

30 35 40 



acc tgg ggt caa tac tea ccc ttc ttc tct gtc ccg tea gag ate tec 24 9 

Thr Trp Gly Gin Tyr Ser Pro Phe Phe Ser Val Pro Ser Glu He Ser 

40 45 50 55 

cct tec gtt cct gat ggc tgc cgc etc acc ttc gee caa gtt etc tec 2 97 

Pro Ser Val Pro Asp Gly Cys Arg Leu Thr Phe Ala Gin Val Leu Ser 

60 65 70 



45 



50 



cgc cac ggc gee cgc ttc cca acc ccg ggt aaa gee gec gee ate tec 345 

Arg His Gly Ala Arg Phe Pro Thr Pro Gly Lys Ala Ala Ala He Ser 

? 5 80 85 90 

get gtc etc acc aaa ate aaa acc tct gee acc tgg tac ggt tec gac 3 93 

Ala Val Leu Thr Lys He Lys Thr Ser Ala Thr Trp Tyr Gly Ser Asp 

95 100 105 



ttt cag ttc ate aag aac tac gac tat gta ctt ggc gta gac cac ctg 441 
55 Phe Gin Phe He Lys Asn Tyr Asp Tyr Val Leu Gly Val Asp His Leu 

HO 115 120 



WO 99/49022 



PCT/DK99/00153 



2 



5 



10 



acc gcg ttc ggc gag caa gaa atg gtc aac tec ggc ate aag ttc tac 489 
Thr Ala Phe Gly Glu Gin Glu Met Val Asn Ser Gly lie Lys Phe Tyr 
125 130 135 

cag cgc tac tec tec etc ate cag aca gaa gac teg gat acg etc ccc 537 
Gin Arg Tyr Ser Ser Leu lie Gin Thr Glu Asp Ser Asp Thr Leu Pro 
140 145 150 

ttc gtc cgc gee tct ggc cag gaa cgc gtc ate gee tec gee gag aac 5 85 
Phe Val Arg Ala Ser Gly Gin Glu Arg Val lie Ala Ser Ala Glu Asn 
155 160 165 170 



ttc acc acc ggc ttc tac teg gee etc tea gee gac aag aac cct cct 633 
15 Phe Thr Thr Gly Phe Tyr Ser Ala Leu Ser Ala Asp Lys Asn Pro Pro 

175 180 185 

tec tec tta cca aga cca gaa atg gtc ate att tct gag gag cca aca 681 
Ser Ser Leu Pro Arg Pro Glu Met Val lie lie Ser Glu Glu Pro Thr 
20 190 195 200 

gee aac aac acc atg cac cac ggc etc tgc cgc tec ttt gaa gat tec 729 
Ala Asn Asn Thr Met His His Gly Leu Cys Arg Ser Phe Glu Asp Ser 
205 210 215 

25 

acc acc ggc gac caa gee caa gcg gaa ttc ate gee gee acc ttc cca 777 
Thr Thr Gly Asp Gin Ala Gin Ala Glu Phe He Ala Ala Thr Phe Pro 
220 225 230 

30 ccc ate acc gec cgt etc aac gee caa ggt ttc aaa ggc gtc acc etc 825 
Pro He Thr Ala Arg Leu Asn Ala Gin Gly Phe Lys Gly Val Thr Leu 
235 240 245 250 

tec aac acc gac gtc eta tea eta atg gac etc tgc ccc ttt gac acc 873 
35 Ser Asn Thr Asp Val Leu Ser Leu Met Asp Leu Cys Pro Phe Asp Thr 

255 260 265 

gtc gee tac ccc ctt tec tec etc acc acc acc tct tec gtt tct gga 921 
Val Ala Tyr Pro Leu Ser Ser Leu Thr Thr Thr Ser Ser Val Ser Gly 
40 270 275 280 

ggc ggc aag tta tec ccc ttc tgc tct ctt ttc act gee age gac tgg 969 

Gly Gly Lys Leu Ser Pro Phe Cys Ser Leu Phe Thr Ala Ser Asp Trp 
285 290 295 

45 

aca ate tac gat tac etc cag tec eta ggg aaa tac tac ggt ttc ggc 1017 

Thr He Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly Phe Gly 
300 305 310 

50 ccc ggt aat tec eta get gee acc cag ggg gta ggg tac gtc aac gag 1065 
Pro Gly Asn Ser Leu Ala Ala Thr Gin Gly Val Gly Tyr Val Asn Glu 
315 320 325 330 

ctt ate gee cgc ttg ate cgt get ccc gtc gta gat cac acg acg acc 1113 
55 Leu He Ala Arg Leu He Arg Ala Pro Val Val Asp His Thr Thr Thr 

335 340 345 
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aac tct act ctt gat ggc gac gaa aaa acg ttt ccg ttg aac aga acg 1161 

Asn Ser Thr Leu Asp Gly Asp Glu Lys Thr Phe Pro Leu Asn Arg Thr 

350 355 360 

5 

gtg tat gcg gat ttt tec cat gat aat gat atg atg aat ate ctg act 1209 

Val Tyr Ala Asp Phe Ser His Asp Asn Asp Met Met Asn lie Leu Thr 

365 370 375 

10 get ttg egg ata ttc gag cat ate agt ccg atg gat aac acc act ate 1257 
Ala Leu Arg lie Phe Glu His lie Ser Pro Met Asp Asn Thr Thr lie 
380 385 390 

ccg acc aac tat ggc cag aca gga gat gac ggg gtg aag gaa agg gat 13 05 
15 Pro Thr Asn Tyr Gly Gin Thr Gly Asp Asp Gly Val Lys Glu Arg Asp 
395 400 405 410 

ttg ttc aag gtt agt tgg gcg gtg ccc ttt get ggg agg gtg tac ttt 13 53 
Leu Phe Lys Val Ser Trp Ala Val Pro Phe Ala Gly Arg Val Tyr Phe 
20 415 420 425 

gag aaa atg gtt tgt gat gcg gat ggg gat ggc aag att gat agt gat 14 01 
Glu Lys Met Val Cys Asp Ala Asp Gly Asp Gly Lys lie Asp Ser Asp 

430 435 440 

25 

gag get cag aaa gag ttg gtg agg att ttg gtt aat gat egg gtg atg 1449 
Glu Ala Gin Lys Glu Leu Val Arg lie Leu Val Asn Asp Arg Val Met 
445 450 455 

30 aga ttg aat ggg tgt gat get gat gaa cag ggt agg tgt gga ttg gag 14 97 
Arg Leu Asn Gly Cys Asp Ala Asp Glu Gin Gly Arg Cys Gly Leu Glu 
460 465 470 

aag ttt gtg gag agt atg gag ttt gcg agg aga ggg ggg gag tgg gag 1545 
35 Lys Phe Val Glu Ser Met Glu Phe Ala Arg Arg Gly Gly Glu Trp Glu 
475 480 485 490 

gag agg tgt ttt gtt tag ctctaga 1570 
Glu Arg Cys Phe Val 
40 495 



<210> 2 

<211> 495 

45 <212> PRT 

<213> Cladorrhinum f oecundissimum 

<400> 2 

Met Leu lie Leu Met lie Pro Leu Phe Ser Tyr Leu Ala Ala Ala Ser 
50 1 5 10 15 

Leu Arg Val Leu Ser Pro Gin Pro Val Ser Cys Asp Ser Pro Glu Leu 

20 25 30 



55 Gly Tyr Gin Cys Asp Gin Gin Thr Thr His Thr Trp 

35 40 



Gly Gin 
45 



Tyr Ser 
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Pro Phe Phe Ser Val Pro Ser Glu lie Ser Pro Ser Val Pro Asp Gly 
50 55 60 

5 Cys Arg Leu Thr Phe Ala Gin Val Leu Ser Arg His Gly Ala Arg Phe 
65 70 75 80 

Pro Thr Pro Gly Lys Ala Ala Ala He Ser Ala Val Leu Thr Lys He 

85 90 95 

10 

Lys Thr Ser Ala Thr Trp Tyr Gly Ser Asp Phe Gin Phe He Lys Asn 

100 105 110 

Tyr Asp Tyr Val Leu Gly Val Asp His Leu Thr Ala Phe Gly Glu Gin 
15 115 120 125 

Glu Met Val Asn Ser Gly He Lys Phe Tyr Gin Arg Tyr Ser Ser Leu 
130 135 140 

20 He Gin Thr Glu Asp Ser Asp Thr Leu Pro Phe Val Arg Ala Ser Gly 
145 150 155 160 

Gin Glu Arg Val He Ala Ser Ala Glu Asn Phe Thr Thr Gly Phe Tyr 

165 170 175 

25 

Ser Ala Leu Ser Ala Asp Lys Asn Pro Pro Ser Ser Leu Pro Arg Pro 

180 185 190 

Glu Met Val He He Ser Glu Glu Pro Thr Ala Asn Asn Thr Met His 
30 195 200 205 

His Gly Leu Cys Arg Ser Phe Glu Asp Ser Thr Thr Gly Asp Gin Ala 
210 215 220 

35 Gin Ala Glu Phe He Ala Ala Thr Phe Pro Pro He Thr Ala Arg Leu 
225 230 235 240 

Asn Ala Gin Gly Phe Lys Gly Val Thr Leu Ser Asn Thr Asp Val Leu 

245 250 255 

40 

Ser Leu Met Asp Leu Cys Pro Phe Asp Thr Val Ala Tyr Pro Leu Ser 

260 265 270 

Ser Leu Thr Thr Thr Ser Ser Val Ser Gly Gly Gly Lys Leu Ser Pro 
45 275 280 285 

Phe Cys Ser Leu Phe Thr Ala Ser Asp Trp Thr He Tyr Asp Tyr Leu 
290 295 300 

50 Gin Ser Leu Gly Lys Tyr Tyr Gly Phe Gly Pro Gly Asn Ser Leu Ala 
305 310 315 320 

Ala Thr Gin Gly Val Gly Tyr Val Asn Glu Leu He Ala Arg Leu He 

325 330 335 

55 

Arg Ala Pro Val Val Asp His Thr Thr Thr Asn Ser Thr Leu Asp Gly 
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340 

Asp Glu Lys Thr Phe 
355 

5 

His Asp Asn Asp Met 
370 

His lie Ser Pro Met 
10 385 

Thr Gly Asp Asp Gly 

405 

15 Ala Val Pro Phe Ala 

420 

Ala Asp Gly Asp Gly 
435 

20 

Val Arg lie Leu Val 
450 

Ala Asp Glu Gin Gly 
25 465 

Glu Phe Ala Arg Arg 

485 



5 

345 

Pro Leu Asn Arg Thr 
360 

Met Asn lie Leu Thr 
375 

Asp Asn Thr Thr lie 
390 

Val Lys Glu Arg Asp 

410 

Gly Arg Val Tyr Phe 

425 

Lys lie Asp Ser Asp 
440 

Asn Asp Arg Val Met 
455 

Arg Cys Gly Leu Glu 
470 

Gly Gly Glu Trp Glu 

490 



350 

Val Tyr Ala Asp Phe Ser 
365 

Ala Leu Arg lie Phe Glu 
380 

Pro Thr Asn Tyr Gly Gin 
395 400 

Leu Phe Lys Val Ser Trp 

415 

Glu Lys Met Val Cys Asp 

430 

Glu Ala Gin Lys Glu Leu 
445 

Arg Leu Asn Gly Cys Asp 
460 

Lys Phe Val Glu Ser Met 
475 480 

Glu Arg Cys Phe Val 

495 
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